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Let  G,  G'  be  directed  graphs.  A  combinatorial  embedding  of  G  into  G' 
is  an  identification  of  each  x  e  V(G)  with  a  set  of  vertices  S  c  V(G') 
such  that  each  S  is  bounded  in  size  by  a  constant  independent  of  |V(G)| 
and  each  arc  in  G  is  carried  into  a  directed  path  of  length  bounded  by  a 
constant  independent  of  |V(G)|.  This  concept  (first  defined  in  [A])  has 
formed  the  basis  for  a  number  of  theoretical  studies  supported  by  ARO 
Contract  No.  DAAG29-76-G-0338,  and  the  papers  collected  here  are  representative 
of  -  with  one  major  exception  -  the  state-of-the-art  with  regard  to  graph 
embeddings. 

First,  a  word  regarding  the  subject  matter  of  these  papers.  By 
modelling  the  control  structures  of  programs  as  classes  of  directed  graphs, 
asymptotic  properties  of  control  structure  transformations  can  be  obtained. 

This  is  the  principle  aim  of  [A,B,C].  Knuth  [1]  surveys  a  number  of  results 
concerning  control  structure  transformations  and  places  the  graph  embedding 
results  in  context.  Directed  graphs  also  model  data  storage  structures 
(vertices  model  nodes  or  records,  arcs  model  logical  adjacencies).  The  notion 
of  graph  embedding  can  be  used  to  compare  the  relative  storage  efficiencies 
of  classes  of  data  structures  [D,E,F].  Several  researchers  have  attempted  to 
generalize  these  results  to  more  encompassing  notions  of  data  storage  and 
representation  (see  e.g.,  [2,7])  and  more  sophisticated  types  of  analysis  [3]. 
The  purely  combinatorial  notions  involved  in  data  structure  embeddings  also 
make  contact  with  a  variety  of  other  theoretical  and  numerical  problems  [4]. 

In  fact,  one  of  the  principle  devices  used  in  the  results  of  [A-F]  is  the 
notion  of  "cutting"  graphs  along  boundaries  of  connected  regions.  A  boundary 
which  cuts  a  graph  is  called  a  separator  and  in  [G,H]  a  characterization  of 
separator  graphs  is  derived  and  used  to  obtain  results  in  areas  from  Turing 
Machine  complexity  to  optimization  theory. 


Graph  Embeddings,  boundary  arguments  and  graph  theoretical  models  of 
computation  all  appear  to  be  related  in  sometimes  surprising  ways  [5, 6, 7, 8]. 
Missing  from  the  collection  is  a  coherent  account  of  these  connections.  It 
will  have  to  suffice  that  the  connections  run  deeper  than  the  surface.  We 
anticipate  reporting  on  this  aspect  of  graph  embedding  elsewhere. 
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ABSimAcr.  Control  structures  and  data  structures  are  modeled  by  directed  graphs.  In  the  control  case  nodes 
represent  executable  statements  and  arcs  represent  possible  flow  of  control:  in  the  data  case  nodes  represent 
memory  locations  and  arcs  represent  logical  adjacencies  in  the  data  structure  Classes  of  graphs  are  compared 
by  a  relation  s  S  T  where  C  s ,  T  H  if  C  can  be  embedded  in  H  with  at  most  a  J -(old  increase  in  distance  between 
embedded  nodes  by  making  at  most  S  “copies"  of  any  node  in  G  For  both  control  structures  and  data 
structures.  S  and  T  are  interpreted  as  space  and  time  constants,  respectively.  Results  are  presented  that 
establish  hierarchies  with  respect  to  ss.r  f»t  (I)  data  structures.  (2)  sequential  program  schemata  normal 
forms,  and  (3)  sequential  control  structures. 

set  woaos  a  no  meases,  ancestor  tree,  bounded  simulation,  complexity,  control  structure,  data  structure, 
directed  graph,  do  forever  program,  embedding,  go  to  proj  m.  label  exit  program,  normal  form  programs, 
structured  programming,  while  programs 

ca  categoxies:  4.22.  4.34.  5.24,  5.25.  5.32 


1 .  Introduction 

The  running  time  or  computational  complexity  of  a  sequential  process  is  usually  esti¬ 
mated  by  summing  weights  attached  to  the  basic  operations  from  which  the  process  is 
derived.  In  practice,  however,  the  complexity  of  a  program  is  often  limited  by  how 
efficiently  it  can  access  its  data  structures  and  control  program  flow.  Furthermore,  it  has 
been  extensively  argued  |4|  that  certain  limitations  on  the  process  sequencing  mecha¬ 
nisms  available  to  the  programmer  result  in  more  “efficient”  representations  for  the 
underlying  processes.  In  this  paper  we  examine  these  issues  in  an  attempt  to  assess  the 
“power"  of  various  data  and  control  structures. 

A  key  observation  about  sequential  processes  is  that  they  usually  do  not  reference 
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their  data  randomly.  For  instance ,  algorithms  that  organize  their  data  structures  as  arra>s 
often  access  the  array  elements  in  a  '•local"  manner  (e.g.  the  conventional  matrix 
multiplication  algorithm  accesses  its  arrays  by  rows  and  by  columns).  Thus  in  a  paging 
environment  how  one  stores  an  array  is  especially  important  (cf.  Moler  1 13],  Rosenberg 
(16)).  and  it  is  natural  to  investigate  how  arrays  can  be  stored  so  that  elements  “near" 
one  another  in  the  array  are  stored  near  one  another  in  memory.  Data  structures  are 
compared  by  the  relation  s,.r:  For  data  structures  G  and  G*.  G  s,,r  G*  if  G  can  be 
embedded  in  G*  so  that  there  is  at  most  a  T-fold  increase  in  distance  between  embedded 
objects. 

It  is  somewhat  unexpected  that  an  analogous  study  for  control  structures  uses  the  same 
basic  insights.  It  is  well  known  that  process  sequencing  disciplines  found  in  programming 
practice  (e.g.  go  to,  while)  can  simulate  each  other  and  are  thus  equivalent  in  the  sense  of 
yielding  functionally  equivalent  programs,  but  are  inequivalent  relative  to  the  stronger 
requirement  of  structural  isomorphism  [1-3,  10.  11].  We  argue  that  the  fundamental 
issue  is  neither  the  construction  of  functionally  equivalent  programs  nor  the  inability  to 
preserve  structure  exactly,  but  rather  the  "naturalness"  of  the  simulation.  Control 
structures  are  compared  by  the  relation  ss.t-  For  algorithms  G  and  G*  with  distinct 
process  sequencing  mechanisms.  G  s  v  rG*  if  G*  simulates  G  by  making  at  most  S  copies 
of  each  operation  in  G  and  increasing  the  cost  of  sequential  access  of  embedded 
operations  by  a  factor  of  at  most  T. 

Thus  comparing  the  power  of  data  structures  and  control  structures  involves  analyzing 
the  one-one  and  manv-one  aspects  of  embedding  (or  simulation)  techniques  whose 
efficiency  is  bounded  by  5  and  T.  In  a  natural  way.  the  relation  ss.t  represents  an 
intertwining  of  space  and  time  complexities. 

In  Section  2  basic  combinatorial  definitions  are  presented,  and  the  combinatorial 
models  used  for  representing  data  structures  and  control  structures  are  introduced.  In 
Section  3  the  relation  sST  is  defined  by  means  of  graph  embeddings.  This  relation  is 
viewed  as  an  embedding  in  the  data  structure  case  and  as  a  simulation  relation  in  the 
control  structure  case.  Section  4  contains  the  main  result  for  data  structure  embeddings: 
For  certain  families  of  structures  {G.J^o  and  {G,*],>n.  if  G,  s1TG,*,  then  T  a;  c-  logo,  (sec 
footnote  I )  for  some  positive  constant  r  whose  choice  is  independent  of  n, ,  the  number  of 
components  of  G,. 

The  main  theorem  in  Section  5  generalizes  the  result  in  Section  4  by  allowing  S  2  1.  In 
this  case,  if  G,  ^.v.rG*  for  certain  natural  choices  of  (G,},,0  and  (G,'},*0.  then  T  +  log  S  2- 
c  log  where  n,  is  the  number  of  components  of  G,  and  c  is  a  positive  constant 
independent  of  n,.  A  direct  result  of  this  theorem  is  that  certain  schema  constructions, 
such  as  Engeler  normal  form  [6|,  cannot  be  achieved  “uniformly’’  with  respect  to  the 
sS  T  relation.  More  exactly,  for  any  constants  S  and  T  there  is  a  go  to  program  G  such 
that  for  no  program  H  in  Engeler  normal  form  is  G  sST  H .  Thus,  the  construction  of 
Engeler  normal  forms -while  always  possible  -  does  not  preserve  time  and  space  in  a 
bounded  way.  This  result  also  demonstrates  how  our  results  will  be  asymptotic  in  their 
nature:  For  any  go  to  program  G  there  are  5  and  T  such  that  G  ss.,  H  where  H  is  the 
Engeler  normal  form;  however,  the  values  of  S  and  T  must  grow  with  the  size  of  the 
program  G . 

In  Section  6  the  relation  £s-r  is  placed  in  the  context  of  relations  used  in  previous 
studies  of  control  structure  simulation.  The  main  simulation  results  for  control  structures 
arc  then  developed,  giving  rise  to  the  hierarchy  of  control  structures  shown  in  Figure  I 
An  important  result  is  that  go  lo  programs  are  strictly  more  powerful  than  label  exit 
programs.  Since  the  class  of  label  exit  programs  includes  many  of  the  standard  constructs 
that  arc  allowed  in  "structured”  programs,  this  result  can  be  viewed  as  a  precise  sense  in 

1  When  wo  establish  results  of  this  form,  we  arc  asserting  that  there  is  a  minimal  rate  of  growth  for  f  as  a 
function  of  n,  In  the  sequel  we  will  consistently  abuse  out  notation  by  writing  fIS,  T I  2  got!  instead  of  the  less 
convenient  ft. Sin),  rom  -  gim  It  will  usualls  be  clear  from  context  when  V.  T  arc  to  be  considered  constants 
and  when  .V  and  7  are  parameterized 
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computed  goto  or  case 

i 

* 

goto  with  d  way  branching  (d>2) 

* 

label  exits 

I 

i 

do  forever 

i 

while 
Fio.  I 

which  there  is  a  lime-space  speedup  between  go  to  programs  and  “structured”  programs: 
There  are  go  to  programs  whose  only  “structured”  counterparts  explode  in  either  time  or 
space.  This  result  seems  to  make  precise  the  comments  of  Knuth  (9]  on  the  efficiency  of 
go  to  and  “structured”  programs. 

While  the  results  in  this  paper  are  motivated  by  our  interest  in  the  (lower  of  data  and 
control  structures,  they  may  have  interest  purely  as  combinatorial  results. 

2 .  The  Combinatorial  Representations 

A  directed  graph  G  is  an  ordered  pair  ( V,  £)  of  nodes  and  arcs.  If  there  is  an  arc  from  x  to 
y  and  an  arc  from  y  lox,  then  we  say  there  is  an  edge  between  x  and  y.  Moreover,  the  arcs 
shown  in  Figure  2(a)  are  represented  as  in  Figure  2(b).  A  path  from  x  to  y  is  defined  by 
any  sequence  of  arcs  from  x  =  x0  to  jr,  to  xt,  .  .  .  ,  to  x.  =  y.  We  define  a  metric 
dc(x,  y)  on  G  as  the  number  of  arcs  in  a  minimal  length  path  from  x  and  y . 

A  binary  tree2  is  a  finite  set  of  nodes  that  either  is  a  single  node  or  consists  of  a  root  x 
and  an  edge  between  x  and  the  root  of  each  of  two  binary  trees  called  the  left  and  right 
subtrees  of  the  root  (cf.  Knuth  [8]).  Note  that  nodes  in  a  binary  tree  are  connected  by 
edges  so  that  the  metric  is  symmetric.  If  G  is  a  binary  tree,  then  a  node  x  of  G  is  a  leaf  of 
G  if  x  has  no  sons. 

We  represent  both  control  structures  and  data  structures  by  directed  graphs.  In  the 
control  case,  the  nodes  of  a  graph  G  represent  executable  statements  and  the  arcs 
represent  possible  flow  of  control;  in  the  data  case,  the  nodes  of  the  graph  represent 
memory  locations  and  the  arcs  represent  logical  adjacencies  in  the  data  structure .  Thus  in 
either  case  what  is  to  be  modeled  is  the  “difficulty"  of  accessing  nodes:  The  complexity 
of  a  control  structure3  is  given  by  the  cost  of  accessing  and  sequencing  noncontrol 
instructions,  while  the  complexity  of  a  data  structure  is  determined  by  the  cost  of 
accessing  successive  data  elements.  Each  class  of  control  structure  or  data  structure  can 
be  studied  in  terms  of  restrictions  on  what  graphs  are  allowed  in  that  class. 

2.1 .  Data  Structures.  The  two  classes  of  data  structure  we  deal  with  are  arrays 
and  ancestor  trees. 

Arrays.  G,  denotes  the  data  structure  corresponding  to  an  n  x  n  array.  If  the  nodes 
of  G,  are  indexed  by  (»',/)  where  1  sisn  and  1  sjsn,  then  there  is  an  edge  between 
(i,j)  and  (/,/  +  l),  for  1  s  i  s  n,  1  s  j  <  n,  and  between  («',/)  and  (i  +  1,/),  for  1  s «  <  n. 
Is  j  s  n.  Thus  G„  is  “rook  connected.”  For  instance,  G>  is  illustrated  in  Figure  3. 

Ancestor  trees.  Ancestor  trees  are  binary  trees  with  an  additional  feature:  A  node  x 
of  an  ancestor  tree  may  be  connected  by  an  arc  to  any  of  its  ancestors.  For  example,  the 
graph  shown  in  Figure  4  is  an  ancestor  tree  because  y  is  both  an  ancestor  and  a  successor 

*  If  a  is  a  root  with  subtree  H  and  y  is  a  root  of  one  o(  the  subtrees  of  x,  then  y  is  *  son  of  x;  further,  x  is  an  an- 
cestor  of  each  node  in  H.  while  each  node  in  H  is  a  successnr  of  x . 

1  Some  care  must  he  exercised  m  viewing  control  structures  that  are  represented  in  this  way;  our  representa¬ 
tions  do  not  always  correspond  to  (temporal)  flow  of  control  and  are  not  to  be  looked  at  as  flowcharts  Ralhcr. 
what  is  being  modeled  is  the  potential  control  connectivity  of  an  underlying  algorithm  or  process 
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of  x.  Notice,  however,  that  unlike  metrics  on  binary  trees,  the  graph  metric  d,.  is  not 
necessarily  symmetric  on  ancestor  trees.  Ancestor  trees  include  linear  lists,  circular 
lists,  and  threaded  lists  [14]  as  special  cases. 

2.2.  Control  Structures.  We  consider  the  following  five  classes  of  control  struc¬ 
tures:  computed  go  to;  go  to  with  d- way  branching;  label  exit;  do  forever;  while.  In 
addition,  all  of  the  available  classes  have  access  to  a  sequential  flow  of  control  and  an 
alternative  (e.g.  if-then-else)  flow  of  control.  Since  the  constructions  described  below  do 
not  involve  schema  manipulation,  the  details  of  these  features  need  not  be  made  explicit . 
Indeed,  there  are  a  number  of  ways  to  represent  these  features  in  our  model,  and  our 
later  results  are  invariant  under  the  differing  representations.  We  now  present  the  class 
of  graphs  that  represent  programs  formed  from  each  of  the  five  control  structures. 

Computed  go  lo  programs,  go  to„  programs  are  programs  that  allow  arbitrary 
branching  between  statements.  For  instance,  we  allow  for  representations  of  the  con¬ 
struct  go  to  I  ( L . . E„),  which  branches  to  the  t'th  label  depending  on  the  value  of  i. 

Thus  this  class  of  programs  is  represented  by  the  entire  class  of  directed  graphs.  The 
construct  above  is  represented  by  the  graph  of  Figure  5.  a  node  with  n  arcs  leading  to 
nodes  labeled  by  L, . L„. 

go  to  programs  with  d-way  branching,  go  to<f  programs  are  programs  in  which  the 
amount  of  branching  that  is  possible  in  one  step  is  bounded  by  the  integer  d.  For 
example,  the  Fortran  construct  IF  <£)  L„  L,.  L3  falls  in  the  class  go  to3.  Programs  with  d- 
way  branching  are  represented  by  the  class  of  directed  graphs  with  a  maximum  out- 
degree  d.* 

Label  exit,  do  forever,  and  while  programs.  Label  exit,  do  forever,  and  while 
programs  are  defined  as  certain  classes  of  ancestor  trees.  In  order  to  define  these  classes, 
we  need  the  following  relations,  which  are  defined  for  any  ancestor  tree: 

x  — »  v  if  v  is  the  left  son  of  x. 

P  ' 

x  -j*  y  if  y  is  the  right  son  of  x. 

i-j>  yil  y  has  an  ancestor  pointer  from  x . 

•  The  oiil-degree  of  a  node  x  I*  |(y  :  d,fx.  v I  -  l)|. 
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We  view  x  j*  y  as  meaning  that  statement  x  can  “push"  into  a  substructure  with  first 
statement  y;x  -j»  y  as  meaning  that  statement  x  is  “sequentially”  followed  by  y ;  andx 
y  as  meaning  that  statement  x  can  “exit"  some  structure  and  return  to  statement  y. 

A  program  is  a  while  program  provided  it  is  an  ancestor  tree  that  satisfies:  y  x 

implies  3 y, . y*  such  that  x  -y*  y,  t*  y,  7*  -  •  •  -j*y*  =  y,  where  y  is  a  leaf  and  no y,  for 

i  <  k  has  an  ancestor  pointer  (see  Figure  6(a)).  The  last  restriction  reflects  the  fact  that 
only  the  last  statement  in  a  while  loop  is  allowed  to  exit  the  loop. 

A  program  is  a  do  forever  program  provided  it  is  an  ancestor  tree  that  satisfies:  y  -5*  x 

implies  3  . . .  =  y  such  that  x  ■$*  y,  -j*  y,  -j*  •  -j *  yk,  where  each  y,  can  have 

ancestor  pointers  only  to  x  (see  Figure  6(b)).  The  key  distinction  between  while 
programs  and  do  forever  programs  is  that  in  a  do  forever  program  all  statements  in  a  loop 
can  potentially  exit  immediately  out  of  the  looping  structure.  Clearly,  do  forever 
programs  correspond  to  the  fl,  (n  2  1)  structures  of  Bohm  and  iacopini  [2). 

Finally,  a  label  exit  program  is  any  program  that  is  also  an  ancestor  tree.  Essentially 
label  exit  programs  allow  any  jumping  out  of  substructures  as  long  as  the  return  is  always 
to  an  ancestor.  The  class  of  label  exit  programs  is  therefore  quite  extensive  and  includes 
many  types  of  so-called  “structured"  programs  (cf.  Peterson  et  al.  [15]).  For  example,  all 
label  exit  programs  arc  reducible  in  the  sense  of  |7);  moreover,  they  correspond  to 
programs  in  Engeler  normal  form  [6], 

Example.  The  following  program  contains  label  exit,  do  forever,  and  while  control 
structures;  its  representation  using  the  conventions  outlined  above  is  shown  in  Figure  7. 

L  S,; 

white  B,  do 
hefia  S,; 
do  forever 
fcef in  St.  nil  L; 

do  forever  befin  5,;  exit:  S,  cod: 

S. 

cod: 

St 

ead. 

S,;  □ 

3.  5,  T  Bounded  Embeddings 

The  following  definition  is  fundamental  to  what  follows.  Let  G  =  (V,  E)  and  G*  = 
(V* ,  E*)  be  directed  graphs  with  associated  metrics  dG  and  rfc. .  We  say  that  G*  can  simu¬ 
late  G  (or  G  can  be  embedded  into  G*)  with  space  constant  5  and  time  constant  T,  writ¬ 
ten  G  ss-r  G* ,  if  there  is  a  mapping  (called  an  embedding )  Q  :  V*  -»  V  U  {A}  of  the  nodes 
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of  G*  to  the  nodes  of  G  and  a  special  node  A.  so  that: 

(1)  Vv*  e  V  with  <t>(v ')  *  A. 

Vw  e  V  such  that  <fc(< l>(v*l.  h’)  <  *, 

3  w'  £  V"  such  that  <I>(h>*)  =  w  and  dGiv‘,  w*)  s  TdGi<P(v0),  w); 

(2)  Vc  €  V.  0  <  |  4»-*(v)  |  =  |  {v*  €  V*  :  <t»(v*)  =v)|sX. 


If  <J»  is  an  embedding  and  <t>fv* >  =  A.  then  we  refer  to  v*  as  a  bookkeeping  node.  If 
=  v  *■  A,  then  v*  is  said  to  be  a  copy  of  v.  If  5  =  1 .  we  often  write  instead  of  s,T. 

Condition  ( l )  states  that  when  G  and  G*  are  control  structures  (or  data  structures) 
simulation  involves  at  most  a  T-fold  increase  in  the  cost  of  statement  sequencing  (or 
data  element  accessing),  i.e.  the  embedding  induces  at  most  a  T-fold  increase  in  path 
length .  Condition  (2)  states  that  there  are  at  most  S  copies  of  any  vEVin  G*.  Note  that 
although  G  sv  r  G*  may  hold  between  data  structures  G  and  G*  when  5  >  1 .  it  is 
unlikely  that  such  a  simulation  would  be  of  value  (e.g.  if  an  array  is  being  stored  as  a  list 
structure  with  multiple  copies  of  array  elements,  then  selective  updating  of  the  array  may 
involve  multiple  updating  of  list  nodes).  For  control  structures,  however,  simulations 
with  S  >  1  are  frequently  used  and  are  quite  natural;  this  is  sometimes  called  node 
splitting. 

Example.  Consider  the  flow  diagrams  shown  in  Figure  8.  Figure  8(b)  is  the  result  of 
applying  a  standard  ''restructuring"  algorithm  1 1 )  to  Figure  8(a)  to  remove  the  multiple 
exit  loop  Xj,  x«,  x5.  Viewing  both  diagrams  as  directed  graphs,  the  graph  in  Figure  8(b)  is 
a  2,2  simulation  of  Figure  8(a)  by  defining  as  follows: 


<!><xf)  =  x,. 

<t>(x,*l  =  x2. 

4>(xJ)  =  <J>(x?)  =  (x?0)  =  A. 

4>tx;i  =  =  x3, 

<i>u.?)  =  =  x4. 


<t»(xB*)  =  Xj, 

•Hr?.)  =  x*. 

<t>(xfa)  =  =  xT, 

4><xf«>  =  4><x  M  *  X*. 
❖ixTsI  =  X,. 


□ 


4.  Data  Structure  Embeddings 

In  this  section  we  present  our  main  result  for  data  structures,  settling  negatively  the 
question  whether  arrays  can  be  stored  as  arbitrary  lists  with  linear  bounds  on  proximity 
and  determining  a  nontrivial  lower  bound  on  the  growth  rale  of  T  as  a  lunetion  of  n  for 
an  nxn  array.  This  result  generalizes  a  result  ol  Rosenberg  |lb|  showing  that  arrays 
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'  Fig  8 


'cannot  be  stored  in  linear  memory  with  only  bounded  loss  of  proximity.  But  since  the 
arguments  are  fundamentally  different,  it  is  interesting  to  compare  the  two  proofs. 
Recall  that  Rosenberg's  arguments  are  essentially  "volumetric":  The  number  of  neigh¬ 
bors  within  distance  n  of  a  node  in  an  array  can  be  quadratic  in  n .  while  a  node  in  a  linear 
list  can  have  at  most  2 n  such  neighbors.  A  volumetric  argument  then  demonstrates  that 
arrays  cannot  be  stored  in  a  system  with  such  linear  neighborhood  structure  with  only 
bounded  loss  of  proximity.  In  contrast,  such  methods  do  not  seem  to  apply  to  our 
problems;  e.g.  a  node  in  a  binary  tree  can  have  more  than  2"  neighbors  within  distance  n. 

To  obtain  our  result  we  need  a  series  of  lemmas.  Let  G  =  (K.  £)  be  a  directed  graph 
with  associated  metric  dG  and  suppose  A  C  V.  We  define  the  boundary  of  A  as  follows: 

d(A  I  =  {>’  e  A  :  3  x  €  A  such  that  dG{x.  v)  =  1}. 

In  other  words.  <)(A)  is  the  set  of  nodes  in  A  reachable  from  some  node  not  in  A  bv  an  arc 
of  G 

Lemma  4.1.  Lei  G,  =  (V„,  E)  be  an  n*n  array  and  suppose  that  A  C  V,  is  such  that 
|  A  |  s  Jn2.  Then  \  A  |  <  2  |  c)(A)  |2. 

Proof.  We  assume  |  A  |  >  0.  since  otherwise  the  lemma  is  trivially  true,  and  let  A,. 

....  A„  be  the  columns  of  A  ;  that  is.  if  {(1, »).  (2.  i) . (n.  i)}  is  the  tth  column  of  G„, 

then  A ,  is  that  subset  of  the  column  that  is  included  in  A .  Let  k  be  the  number  of  columns 
A,  such  that  |  A,  |  <  n.  and  let  /  s  A  be  the  number  of  columns  At  with  0  <  |  A,  |  <  n. 
Since  |  A  \  s  Jn2.  it  follows  that  (n  -  k)n  s  }n2  and  hence 

k  s  \n.  (1) 

Notice  that  if  0  <  |  A,  |  <  n.  then  at  least  one  node  in  A,  is  adjacent  to  a  node  not  in  A 
and  thus  contributes  at  least  one  node  to  MA);  therefore 

|  <»(A)  |  a  /. 

Suppose  that |  Ate  |  =  0  for  some  i9.  I  s  i0  s  n.  We  then  claim 
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|  <9(A)  |  2  max  |  A,  j.  (3) 

I 

To  show  this,  let  A,  be  maximal  in  si2e  and  assume  i0  <  j.  the  case  j  <  i„  being  handled 
symmetrically  Select  an\  row  r  of  G„  such  that  (r,  j)  E  A,-.  Now,  Ir,  /„)  £  A  by 

assumption,  but  some  one  or  more  of (r, t0  +  1) . (r , /)  is  in  A  .  Therefore  each  row  r 

of  G .  for  which  (r./l  E  A ,  contributes  at  least  one  node  to  d(A),  which  establishes  (3). 
To  complete  the  proof  of  the  lemma  we  consider  two  cases. 

I.  No  A,  is  empty.  In  this  case.  /  =  k,  and  by  combining  (1 )  and  (2), 

2  |  d(A ) |*  2  2P  =  2k*  2  in1  2  \A  j. 

II.  Some  A,  is  empty.  Let  c . .  denote  the  cardinalities  of  the  nonempty  columns 

A,.  If  some  cp  =  n,  then  the  result  follows  directly  from  (3).  If  not,  then  m  =  /  andci  + 

•  +  cm  =  |  A  |,  so  that  max,  |  A,  |  2  |  A  | //.  By  (2)  and  (3)  it  follows  that  2  |  <J(A)  |  2  / 
+  |  A  |//.  The  lemma  is  now  immediate  by  calculation.  □ 

Lemma  4.2.  Let  G,  =  (V,,  E )  and  suppose  x,  y  E  V,;  then  dGifx,  y)  s  2n. 

Proof.  This  is  an  elementary  property  of  arrays.  □ 

Lemmas  4.1  and  4.2  and  the  fact  that  |  V„  |  =  /t2  summarize  the  basic  properties  of 
arrays  that  will  be  used  in  the  proof  of  our  main  result. 

Lemma  4.3.  Let  H  =  (V,  E)  be  an  ancestor  tree  and  let  H„  =  (V0,  E0)  be  a  subtree  of  H. 
If  x  E  V0 and  y  E  V  -  V„,  then  dH(y.x)is  greater  than  or  equal  to  the  depth  of  x  in  H0. 
Proof.  Since  y  £  V'o,  any  path  from  y  to  x  must  pass  through  the  root  of  H„.  □ 

Lemma  4.4.  Let  H*  =  (V* ,  £')  he  an  ancestor  tree:  let  HI  =  <VJ ,  £{)  fee  a  subtree  of 
H';  and  let  A  =  -  {A}.  If  G.  sr  H'  and  \  A  |  £  in*,  then  T  2  J(/og |  A  |  -  1);  in 

other  words ,  |  A  |  S  22r' ' . 

Proof.  Assume  that  |  MA  I  |  >  2r.  Since  the  root  of  WJ  has  at  most  2’  descendants  of 
depth  less  than  7"  +  1 ,  there  is  a  rode  t*  E  Vo  of  depth  greater  than  or  equal  to  T  +  1  in 
H J  such  that  4>U*I  £  fl(A).  Since  Ou*»  E  ri(A).  there  is  ay  6  V,  -  A  withdc.(y,  4>U*)I 
<  I .  Now  there  exists  ay*  such  that  <t>( v * )  =  y  and  dT.  (y*.x*l  £  T,  by  the  definition  of 
£r.  Since  y  £  A  it  follows  thaty*  £  VJ.  But  by  Lemma  4.3,dH.(y*,  Jt*)  2  T  +  1,  which 
is  a  contradiction.  Therefore,  |  dIA )  |  <  2’  and  bv  Lemma  4.1  |  A  \  s  |  d{A)  |2  £ 
2,r*'.  □ 

Theorem  4.5.  Let  H‘  =  ( V* ,  E*)be  an  ancestor  tree .  If  G„  2  r  W* ,  then  T  2  J  log  n  -  J. 
Proof.  Assume  G,  £7-  W*  and  for  any  subtree  H,  =  (V*,  £,*)  of  H *  let  A,  =  <I»  ( Vf )  - 
{A}.  Let  Hf  and  Hi  be  subtrees  of  some  node  in  W* .  Either  |  A,  |  £  }n2  or  |  A2  |  s  JnJ, 
since  <!>  is  1-1 .  Using  this  fact,  we  may  assume  that  H*  is  of  the  form  shown  in  Figure  9, 
where  |  A,  [  £  for  1  £i  £  k.  (We  have  suppressed  explicit  representation  of  ancestral 
links.)  Without  loss  of  generality  we  assume  always  that  the  “smaller”  subtree  is  on  the 
right.  By  Lemma  4.4,  |  A,  |  £  2*r+l  for  all  i. 

Let  i  be  the  smallest  integer  such  that  |  A,  |  *  0.  and  let  j  be  the  largest  such  integer. 
Then  V;_(  |  A,  |  £  (/  -  /  +  1)2”*'.  Since  |  V„  |  =  n2. 
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(j  -  i  +  l)2*r*‘  2  n*.  (1) 

Now  let**  e  V*  and y*  6  V*.  Then  by  Lemma  4.3 ,  dw.(y*,  **)  a;  -  i. 

On  the  other  hand,  by  Lemma  4.2.  dc,{Q{y*),  4>U*))  —  2n;  hence,  since  G,  H *, 

d*-  (>'*•**>  s  2n7\  Thus 


j  -  i  s  2 nT.  (2) 

Combining  (1 )  and  (2).  we  have  2/iT  +  1  a  n*/ 2*T+I.  It  follows  that  T  a  i  log  n  -  i.  □ 

5 .  Main  Theorem 

Observe  that  the  S  =  1  hypothesis  was  used  at  several  key  points  in  the  proof  of  Theorem 
4.5.  Since  this  restriction  is  unrealistic  in  dealing  with  control  structures,  we  now  remove 
it  by  generalizing  the  previous  result. 

Theorem  5.1 .  Let  H*  =  (V",  E')be  an  ancestor  tree  and  let  G„  £S.TH*  where  G,is  an 
n  x  n  array.  Then  T  +  log  S  2  log  n  -  log  8v2 

Proof.  Let  <!>  be  the  embedding  function,  and  define  a  new  function  ¥  mapping 
subsets  of  V*  to  subsets  of  V  by  WA*)  =  <t>(/4*)  -  {A}.  In  other  words,  'P(A*i  contains 
those  nodes  of  V  for  which  copies  exist  in  A*. 

As  in  Theorem  4.5.  we  decompose  H'  as  follows.  Let  or*  be  the  root  of  W*  and  write 
//*  as  in  Figure  10(a).  where  we  may  assume  |  |  £  |  ¥(/?,*)  |  without  loss  of 

generality  Clearly,  this  process  can  be  repeated,  letting  Jt*+1  denote  the  root  of  R ,*  and 
expanding  R *  at  each  stage  of  the  construction.  Thus,  H‘  can  be  written  as  in  Figure 
10(b).  where  |  'P(L*)  |  s  |  ♦(Rf)  |  for  1  £  «  s  k.  Notice  that  we  have  ignored  all 
ancestral  links  in  this  construction.  Indeed,  we  assume  that  all  such  links  exist  but 
suppress  explicit  reference  to  them. 

Let  H?  =  (V;,  E* i  denote  the  subtree  of  Figure  10(c).  We  say  that  H'  is  small  if 
|  ¥(  V*)  |  £  in’;  otherwise  W,*  is  large.  (The  notion  of  smallness  is  motivated  by  the  key 
to  the  argument  in  Theorem  4.5.)  Let 

Dk  =  U  W). 

I  SIS* 

H7  small 

In  other  words,  D*  is  the  set  of  nodes  in  V  of  which  copies  exist  in  some  small  H* . 
Lemma  5  2.  For  some  p,  in*  ss  |  Dp  |  <  in*. 

Proof.  By  convention,  |  D„  |  =  0.  If  |  Dr-,  |  <  in* and  |  D,  |  2  In*,  then  Dp  =  Dp- , 
U  'P(V;),  where  Hi  is  small,  so  that 

I  D,  |  s  I  Dp-,  I  +  I  wv;)  |  <  in*  +  in*  =  in*. 

Thus  we  need  show  only  that  |  Dp  |  2  in*  for  some  p. 

Since  |  ♦<£.,•)  \  s  |  'V(Rl)  |  and  M'(V)  =  ¥(*.,*)  U  ♦(*,*)  U  V  (/?,*),  it  follows  that  n* 
=  |  'P(V)  |  S  I  m; I  |  +  1  +  |  ♦<«,*)  I  £  2  j  ¥(/?,*)  I  +  1;  hence  |  |  2  *(n*  -  1)  2 

in*.  We  can  obviously  choose  p  so  large  that  |  'P(/?;)  |  =  0.  We  claim  that  the  associated 
Dp  is  large . 

Let  i  be  the  largest  integer  such  that  |  V(R?)  |  2  in’.  Since  |  ♦(/?*)  |  2  in’,  such  an  / 
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must  exist.  Then  |  |  <  for /  <  /  £  p.  and  since  ViV?)  =  '•'(*•,*>  U  V(xJ),  be¬ 

have 

I  mvj\  |  s  1  +  |  ♦</.*)  |  s  1  +  |  wr;)  I  <  i  +  W- 

Thus  H*  is  small  for  i  <  j  £  p-  But  this  implies  'i'(R’)  Q  U,<w  Vl(V*)  Q  L>p  By  our 
choice  of  i.  however,  we  conclude  that  |  0„  |  a  |  ¥(/?,*)  |  z  in1,  establishing  our 
claim.  □ 

We  now  introduce  a  variant  of  the  concept  of  boundary.  If  A  is  a  set  of  nodes  of  G„. 
then  the  coboundary  of  A  is  defined  by 

3(A)  ■  {y  $  A:  there  exists x  G  A  such  that  dG ,  (x.  y)  =  1} 

=  3(V„  -  A). 

In  other  words,  3(A)  is  the  set  of  nodes  not  in  A  reachable  from  some  node  in  A  in  one 
step.  The  proof  of  the  following  result  is  similar  to  that  of  Lemma  4.1  and  is  omitted. 
Lemma  5.3.  Let  A  be  a  set  of  nodes  of  G.with  \  A  )  £  in’.  Then  |  A  \  £  2  |5(A)  |*. 
Let  k  satisfy  Lemma  5 .2 .  By  Lemma  5.3,  j  5(0*)  \  z  \  0*  \n/yj2  z  n/2J2.  Now  let  /  = 

|  \H:  :  «*  is  large}  |,  the  number  of  large  subtrees.  Since  at  most  S  copies  of  any  node  in 
G,  appear  in  H * , 

/n3/4  £  £  |  V<V'?)  |  s  Sn1. 

Isis* 

H.*  Urge 

Hence  /  £  45. 

In  order  to  complete  the  proof  we  proceed  as  follows.  We  have  already  shown  that 
|  3(0*)  I  z  n/2V2;  we  show  next  that  this  implies  that  there  are  too  many  paths  into  the 
large  trees  H*  from  the  small  trees  for  5  and  T  to  be  bounded 
Let 

Qt  -  {v*  €  V’  :  H’  is  large  and  there  exist  W*  small  andx*  6  V*  such  thatd„.(x*,  v*)  £ 
T }. 

In  other  words,  QT  is  the  set  of  nodes  in  large  subtrees  H*  that  can  be  reached  from  some 
node  in  some  small  subtree  Hf  in  at  most  T  steps.  We  define  a  one-to-one  mapping# 
from  3(D»)  into  Qj  as  follows.  Select  some  y  G  5 (Dk).  Theny  C  Dk  and,  for  some  x  G  Dk, 
dc,(Jr,y)  £  L  Let  jt*  be  a  copy  of*  in  some  small  Hi .  Such  a  copy  exists  by  the  definition 
of  Dk.  Since  G„„s.r  //*,  there  is  a  copyy*  ofy  such  that  dH.(x’,y’)  £  T.  Now  y*  is  not  in 
any  small  Hi  since  y  G  Dk.  Thus  we  can  define  giy)  =y*,and#  is  indeed  a  mapping  from 
MDk)  to  Qt-  In  order  to  see  that  g  is  one-one,  we  note  that  for  any  y  G  3(0*),  ^(y)  = 
♦(y*)  =  y;  hence  g  is  one-one,  so  that  |  Qt  I  a  I  3(0*)  |. 

Thus  we  have,  on  the  one  hand,  that  |  QT  \  z  \  3(0*)  |  z  n/2y/2  and,  on  the  other  hand, 
that 

|  Qt  |  £  |  {Hi  :  Hi  is  large}  |  •  |  (v*  :  v*  G  large  Hi  within  depth  T  of  the  root  of  Hi)  \ 

£  /•  2T  £  45 ■  2r. 

Combining  the  upper  and  lower  bounds  on  |  Qr  |.  we  deduce  that  T  +  log  5  s  logn  -  log 
8,/2.  O 

As  an  application  of  Theorem  5.1,  we  present  the  following  result.  Informally  a 
flowchart  is  said  to  be  in  Engeler  normal  form  if  it  is  represented  by  a  tree  augmented  by 
pointers  from  nodes  to  ancestors,  or  nodes  at  an  earlier  level  but  along  the  same  branch. 
More  precisely,  a  go  to  program  G  has  an  5.  T  Engeler  normal  form  if  G  £.v.r  H  for  some 
ancestor  tree  H. 

Corollary  5.4.  ( 1)  //  G,  has  an  5,  T  Engeler  normal  form  and  T  is  fixed ,  then  5  a 
c-n.  (2)  If  G„  has  an  5,  T  Engeler  normal  form  and  5  is  fixed,  then  T  z  c-log  n. 

Thus  in  the  worst  case,  cither  time  or  space  must  be  unbounded  in  the  construction  of 
Engeler  normal  forms. 
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6.  Control  Structures 

In  this  section  we  establish  our  main  results  for  control  structures,  using  the  relation  s-  N , 
(see  Figure  1 ).  For  classes  X  and  V  of  control  structures,  i  e .  classes  of  graph  representa¬ 
tions  of  programs  constructed  using  only  control  structures  from  the  indicated  restricted 
class  of  control  structures,  we  say  that  X  is  more  powerful  than  V  when  there  exist 
constants  S'.  T'  such  that  (l )  for  all  H  E  Y  there  exist  G  €  X  such  that  H  sv  r.  G.  but  for 
no  constants  S,  T  is  it  true  that  (2)  for  all  G  6  X  there  exists  H  £  Y  such  that  G  £.s.r  H 

Since  for  the  hierarchy  of  Figure  1  if  A-  is  more  powerful  than  V.  then  the  control 
structures  in  Y  are  restrictions  of  the  control  structures  in  X.  condition  (1)  is  trivially 
satisfied  with  S'  -  T'  =  1.  It  is.  of  course,  the  results  that  establish  condition  (2)  that 
have  the  greatest  novelty. 

To  place  our  results  in  historical  perspective,  we  follow  Ledgard  (12)  in  distinguishing 
the  following  extremes  in  simulations  among  control  structures: 

(1)  G  is  functionally  simulated  by  H  (written  G  sf  H)  if,  under  identical  interpreta¬ 
tions.  G  and  H  compute  the  same  function. 

(2)  G  is  very  strongly  simulated  by  H  (written  G  s„  H)  if  G  H  and  if  <t>  is  an 
embedding  inducing  £,  ,,  then  the  domain  of  <t>  and  the  range  of  <t»  are  identical  sets. 

In  (2 1  it  is  shown  that  for  each  go  to  program  G  there  exists  a  while  program  H  such 
that  G  £/  H.  while  in  1 1 0)  it  is  show n  that  for  some  go  to  program  G  there  does  not  exist 
a  while  program  H  such  that  G  s„  H.  Several  other  notions  of  simulation  intermediate 
to  s,  and  have  also  been  used  to  study  the  relative  power  of  classes  of  control 
structures  (I ,  .1.  11.  IS) 

The  connection  between  our  relation  sS  T  and  these  relations  is: 

(1 )  -s.t  is  weaker  than  s„,  since  we  allow  both  space  and  time  to  increase  and  do  not 
require  <I>  to  have  identical  range  and  domain; 

(2)  ts.v.r  is  stronger  than  sf,  since  we  require  that  paths  be  preserved  in  a  weak  sense; 

(3)  s.v.r  deals  only  with  combinatorial  aspecls  of  program  structure,  and  thus  we 
make  no  assumptions  about  adding  program  variables  or  extra  predicates  (as  were  made 
for  example  in  1 1 .  2.  II)). 

We  thus  claim  that  the  hierarchy  theorems  presented  in  this  section  span  the  relations 
used  in  previous  studies.  For  the  remainder  of  this  section  we  adopt  the  notation  X  *  Y 
to  indicate  that  Y  C  X  but  the  graphs  in  X  are  not  uniformly  simulated  by  the  grapl.s  in 
Y.  i.e.  X  is  more  powerful  than  V. 

We  will  make  use  of  the  following  definitions.  For  any  directed  graph  G  let 
(/,  x)  =  |{y  :  dG  (y.  *1  -  01  Km  (I.  x>  -  |{v  :  dc  Ur.  y)  s  /)). 

Lemma  6.1.  Suppose  that  G  s.s.rG’  and  let  x  be  a  node  in  G.  Then  ( I )  for  am  cops 
x*  ofx.  iV'-,,  </,x)  £  JVgJ  lTI,x’):and  (2)  for  some  copy  x*  of x ,  N?*  (l.xls  S  .V'jtT.x'l 

The  proofs  of  both  (1 )  and  (2)  follow  easily  from  the  definition  of  s.s.r  and  arc  left  to 
the  reader. 

Theorem  6.2.  do  forever  i  while. 

Proof.  Let  5.  T  be  such  that  for  all  do  forever  programs  G.  there  is  a  while  program 
H  for  which  G  s.vr  H.  By  part  (2)  of  Lemma  6. 1 ,  for  any  node  x  in  G  there  exists  a  copy 
x*  of  x  such  that  A/fj,  (1.x)  £  S  N['„  (T.  x*).  But  since  W  is  a  while  program,  nodes  in  H 
have  at  most  one  ancestor  pointer  to  them,  so  that  Nft  (T.r'l  s  27 .  Thus  <V;-„  <  1 . « i  £  S2’ 
for  any  do  forever  program  G  and  any  node  x  in  G.  This  is  a  contradiction,  since  the 
number  of  ancestor  pointers  to  nodes  in  do  forever  programs  can  be  unbounded 

Theorem  6.3.  label  exit  i  do  forever. 

Proof.  Let  5,  T  be  such  that  for  any  label  exit  program  G  there  exists  a  do  forever 
program  H  such  that  G  £.vr  H.  Consider  the  label  exit  graph  G”1’  defined  as  follows 

(I)  V"’  =  (x, . x„},  (2)  x,  -»  x,.,  for  all  1  s  i  <  n.  (3)  x.— >x,  for  all  1  £  i  ^  n 

(Sec  Figure  1 1 .)  Then,  by  construction.  V£"'<L  x„)  =  n  -  1. 


15 


Space  and  Time  Hierarchies 


Fig  II 


Let  H"’  be  Che  corresponding  do  forever  program.  Then  a  node  x*  in  W*  has  at  most 
two  sons  and  one  ancestor,  so  that  N£,l’(T,  x *)  s  V.  Thus  by  part  (1)  of  Lemma  6.1 , 
n  -  1  =  NJIill,  x„)  s  V{U,(T,  x*)  £  3r,  where  x*  is  any  copy  of  x„  in  W,  a  contradic¬ 
tion.  □ 

Theorem  6.4.  For  i/s  2,  go  to*  £  label  exit 

Proof.  Notice  that  arrays  are  included  in  go  to/for  f  z  4.  Thus,  by  Theorem  S.I ,  we 
have  go  to,  £  label  exit  (/  z  4).  since  label  exit  programs  are  ancestor  trees.  To  complete 
the  proof  it  is  sufficient  to  note  that  for  anv  array  G„  there  is  some  H  in  go  to,  such  that 
Cm  □ 

Theorem  6.5  go  to*  £  go  to<i  for  all  d  z  1 . 

Proof.  Let  S.T  he  such  that  for  all  go  to*  programs  C.  there  exists  a  go  to*  program 
H  such  that  G  s,rH.  By  part  ( 1 )  of  Lemma  6.1 ,  for  all  nodes  x  in  G  there  is  a  copyx*  of 
x  in  H  such  that  /V"ai  ( T ,  x*)  z  (1.  x).  But  A/",,  (T,  x*)  £  dT .  since  the  out-degree  of 
any  vertex  in  H  is  at  most  d  This  is  a  contradiction,  since  in  go  to*  NSu,  II,  x)  is 
unbounded .  □ 

7 .  Conclusion 

The  methods  for  comparing  data  structures  and  control  structures  by  the  relation 
appear  to  be  quite  general,  and  there  arc  several  straightforward  extensions  of  the  data 
structure  embedding  results  that  recover  relationships  between  other  data  structures. 

In  the  case  of  control  structures,  the  conclusions  to  be  drawn  are  perhaps  more  widely 
varying  and  seem  to  give  direction  to  further  investigations.  The  observation  in  Corollary 
5.4  that  a  standard  schema  construction  is  inherently  inefficient  leads  us  to  question  the 
status  of  other  property  preserving  transformations.  We  also  believe  that  Theorem  6.5 
has  implications  for  the  often  quoted  "theoretical  foundations  of  structured  program¬ 
ming";  this  is  particularly  apparent  because  the  go  to  program  that  enters  the  proof  is 
itself  a  highly  structured  object.  Indeed,  a  £.«;t  embedding  into  a  label  exit  program  fails, 
not  because  G„  is  ill  structured,  but  rather  because  of  the  densely  hierarchical  nature  of 
the  control  flow .  We  thus  offer  programs  of  the  form  G,  as  "structured''  go  to  programs 
whose  structures  cannot  be  maintained  by  less  general  control  structures.  The  exact 
relationship  between  ancestor  trees  and  reducible  flow  graphs  |7)  is  still  unsettled,  but 
some  work  has  been  done  to  place  the  reducible  flow  graphs  in  the  hierarchy  of  Figure  1 
|5|.  More  recent  extensions  of  the  results  presented  here  give  techniques  for  uncovering 
total  space  and  time  simulation  trade-offs,  as  opposed  to  the  worst-case  analyses  of  this 
paper  |5|.  Finally,  the  extension  of  these  results  to  parallel  and  asynchronous  control 
structures  appears  to  be  possible  and  promises  to  yield  important  information  about  the 
relative  power  of  nonsequential  mechanisms. 
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Abstract:  Let  G  and  G*  be  programs  represented  by  directed  graphs.  We 


define  a  relation  £  _  between  G  and  G*  that  formalizes  the  notion  of  G* 

b  9  l 

simulating  G  with  S-fold  loss  of  space  efficiency  and  T-fold  loss  of  time 
efficiency,  and  prove  that  if  G  £  G*,  where  G  has  n  statements  and  G*  is 

O  9  1 

structured,  then  in  the  worst  case  T  +  log2log2  S  2  log2n  +  0(log2log2n) . 

Keywords  and  Phrases:  ancestor  tree,  complexity,  control  structure, 
directed  graph,  embedding 


CR  Categories:  A. 22,  4.34,  5.24,  5.32 


19 


1.  Introduction 

In  a  previous  paper  [13,  we  made  precise  some  Intuitive  observations 
concerning  the  efficiency  of  structured  programs  by  defining  a  combinatorial 
relation  that  corresponds  to  the  notion  of  uniform  simulation  between  programs. 
Informally,  we  say  that  a  program  G*  uniformly  simulates  a  program  G  if  G* 
carries  out  the  computation  of  G  (and  possibly  additional  computation  which 
might  be  regarded  as  ''bookkeeping")  in  such  a  way  that  the  space-time  efficiency 
of  G  is  degraded  by  a  factor  that  is  independent  of  the  size  of  G.  The  main 
results  of  [1]  indicate  that  the  non-existence  of  uniform  simulations  among  many 
well-known  classes  of  control  structures  is  due  to  the  combinatorial  aspects  of 
program  structure  and  is  not  at  all  related  to  such  details  of  program  organi¬ 
zation  as  choice  of  data  structures  or  limitations  on  the  form  of  Boolean 
expressions. 

Indeed,  the  main  result  of  [13  (Theorem  5.1)  provides  a  non-trivial  lower 
bound  on  the  loss  of  space-time  efficiency  in  any  structured  simulation  of  a 
goto  program.  This  short  note  extends  that  result,  improving  the  space-time 
inequality  of  [1,  Theorem  5.13  by  an  exponential.  Thus  we  now  show  that  there 
are  goto  programs  with  n  statements  such  that, for  any  structured  simulation, 

the  simulation  runs  at  least^ 

Cjlog2n 

times  as  slow  as  the  original  program, 

c3 

con 

the  simulation  has  at  least  2  statements. 

We  use  c^,  c^,  c^  to  denote  positive  constants. 


either: 

1) 

or 

2) 
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1. e.,  there  are  goto  programs  that  can  only  be  simulated  by  either  very  slow 
or  very  large  structured  programs. 

In  the  sequel,  we  will  concentrate  on  the  combinatorial  theorem  that 
achieves  these  bounds.  The  programming  language  significance  of  the  graphs 
and  relations  studied  here  is  discussed  extensively  in  [1]. 

2.  Preliminaries 

A  directed,  graph  G  is  an  ordered  pair  (V,E)  of  vertices  V  and  edges 

E  c  V  x  V.  A  path  in  G  is  an  ordered  sequence  of  vertices  connected  by  edges. 

For  vertices  x,y  e  V,  let  dg(x,y)  denote  the  length  of  a  minimum  length  path 

form  x  to  y.  If  no  such  path  exists,  then  d  (x,y)  *  •». 

G 

A  binary  tree  is  a  directed  graph  that  consists  of  either  a  single  vertex 
or  a  root  x  and  edges  between  x  and  the  root  of  each  of  two  binary  trees  called 
the  left  and  right  subtrees  of  x.  A  vertex  x  in  a  binary  tree  is  a  leaf  if  it 
has  no  sons.  If  H  *  (V,E)  is  a  binary  tree  with  root  r  e  V  and  leaf  l  c  V,  and 
P  =  (xj,...,xn)  is  a  direct  path  from  x^  »  r  to  xr  *  i,  then  P  is  called  a 
branch  of  H.  An  ancestor  tree  G  ■  (V,E)  is  a  directed  graph  with  the  following 
properties: 

1)  There  exists  a  subset  E  c  E  such  that  G  *  (V,E  )  Is  a  binary  tree; 

o  ~  o  o  J 

2)  If  (x,y)  c  E  -  E  ,  then  y  is  an  ancestor  of  x  in  G  . 

o  o 

Let  G  denote  the  n  *  n  rook-connected  array  of  vertices.  If  the  vertices 
n 

of  Gn  are  indexed  by  (i,j)  for  lsi.jsn,  then,  except  for  the  obvious  extremal 
conventions,  there  are  symmetric  edges  between  (i,j)  and  (i,J+l),  (i+l,j). 

For  any  directed  graph  G  ■  (V,E),  the  notion  of  boundary  makes  sense. 

Let  A  c  v.  Then  the  boundary  of  A  is  defined  as 

3(A)  ■  (yeV-A:  3x«A  such  that  (x,y)cE) 


Clearly,  3(A)  denotes  the  set  of  vertices  not  in  A  which  are  reachable  from  A  by 
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a  single  edge. 

By  a  simple  improvement  of  a  result  from  Cl],  we  have  the  following 
important  property  of  arrays: 

Lerrma  1:  (Boundary  Lemma)  Let  A  be  a  set  of  vertices  of  Gr  with  |A|  £  n2/2. 

Then 

2 1  A|  £  |  3  <A)  1 2 . 

3.  Graph  Embedding 

The  following  relation  was  defined  in  [1].  Let  G  ■  (V,E)  and  G*  »  (V*,E*) 
be  directed  graphs,  and  let  S,T  >  0.  Then  G  £g  ^  G*  if  there  is  a  partial 
function  (called  an  embedding)  #:V*  -*•  Vu{A),  of  the  nodes  of  G*  to  the  nodes  of 
G  and  a  special  node  A,  such  that 

1)  0  <  |4>  1(x)|  £  S  for  all  x  e  V; 

2)  For  all  x*  c  4>-*(V),  ii  d„.(#(x*),y)  <  «  for  some  y  e  V,  then  there  exists 
y*  e  $  ^(y)  such  that  dG^(x*,y*)  £  dG($(x*),y). 

If  #(v*)  ■  A,  then  we  refer  to  v*  as  a  bookkeeping  node •  If  4>(v*)  »  v*A, 
then  v*  is  said  to  be  a  copy  of  v.  Condition  (1)  states  that  there  are  at  most 
S  copies  of  any  veV  in  G*.  Condition  (2)  states  that  the  embedding  Induces  at 
most  a  T-fold  increase  in  path  length. 

Theorem  1:  Cl,  Theorem  5.2]  If  S(n),  T(n)  are  such  that  G^  -o,(n)  T(n)  G*  ^°r 
some  ancestor  tree  G*,  then 

T(n)  +  log2S(n)  *  log2n  +  Cj.  (1) 

The  right  hand  side  of  inequality  (1)  cannot  be  improved,  since  with  S(n)  =  1, 
the  construction  of  L2]  shows  that 
T(n)  -  0(log2n) 

+  The  notion  of  boundary  used  here  corresponds  to  the  coboundary  of  Cl]. 


V 
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is  achievable  for  any  n  vertex  graph.  Theorem  1,  however,  gives  only  a  linear 
bound  on  S(n),  and  it  has  been  conjectured  that  a  non-polynomial  lower  bound  on 
S(n)  exists.  In  the  next  section  we  obtain  such  a  bound. 

4.  Main  Theorem 

In  this  section,  we  obtain  the  following  improvement  of  Theorem  1: 

Theorem  2:  If  G*  is  an  ancestor  tree  and  Gr  -$(n)  T(n)  G*»  then 

T(n)  +  log2log2S(n)  2  log2n  -  0(log2log2n) . 

Proof:  For  notational  convenience,  let  us  systematically  confuse  a  graph  with 

its  set  of  vertices,  so  that  "x  e  G"  and  "x  e  V"  mean  the  same  thing  if  G  *  (V,E). 

We  assume  G  <  _  G*  via  an  embedding  $.  For  any  A*  £  G*,  we  use  $(A*)  to 

n  S,T 

denote  the  set  of  x  €  G  which  are  ^-images  of  some  x*  e  A*.  Henceforth,  we 

n 

assume  that  G*  is  a  binary  tree ;  it  will  be  obvious  as  we  progress  that  if  G* 
contains  ancestor  edges,  then  the  proof  is  completely  unaffected. 

Let  P  »  (x*,...,  *)  be  a  path  of  G*.  Then  P  is  an  admissible  path  if  it 
is  constructed  as  follows:  For  each  x*  (isisk),  let  L*  denote  the  subtree  of 
x*  containing  x*+1«and  let  R*  denote  the  other  subtree  of  x*;  then  either 

a)  *(R*)  2:  4>(LJ) 
or 

b)  *(R*)  *  n2/4. 

Note  that  the  definition  of  admissible  path  is  more  general  than  that  used  in  [1J. 
Indeed,  it  is  by  proving  the  existence  of  many  such  admissible  paths  that  we  obtain 
our  result. 

We  fix  an  arbitrary  admissible  path  P  ■  (x*,...,x£)  and  define  for  i  *  l,...,k 
the  subtree  H*  -  L*  u  {x*}.  We  shall  say  that  H*  is  email  if  l*(H*)|  s  n2/4; 
otherwise  H*  is  said  to  be  large.  Let 
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u->i  *. ,'.:v 


♦(H*> ; 


lsisj 


Is  small 


in  particular,  D,  is  the  set  of  vertices  in  G  which  have  copies  in  some  small  H*. 

K.  G  1 

Lemma  3:  For  some  j , 


iv 


Proof:  We  need  only  show  that  there  exists  an  integer  j  such  that  |D^ |  2  n2/4, 
since  if  j  is  the  least  such  integer,  then  (assuming  ID^I  *  0) 

n2  ,  n2 


IDjI  *  IDj.jI  +  l*(H*)|  <  +  n2/2 


We  claim  that  |«>(R>p|  2  n2/4.  For  suppose  otherwise,  whence  |4>(L*)|  s  |4’(R*)I 
by  the  definition  of  an  admissible  path.  Now 


4>(G*)  -  4>(H*)  u  4>(R*), 


so  that 


and  thus 


n2  -  |*(G*)|  5  l*(L*)|  +  1  +  |*(R*)|  S  2|*(R*)|  +  1, 


l*(RJ)l  *  n2/4. 

Let  j  be  such  that  |$(R*)1  «  0,  and  let  i  be  the  largest  integer  such  that 
[4>(R*)|  2  n2/4.  Then 

|4>(R*)|  <  n2/4,  for  l  -  i+l,...,j. 


Hence, 


I  ♦  (H*)  I  <  1  +  1 4><L*)  |  s  1  +  |*(R*)|  <  1  +  n2/4  for  all  l  -  i+l,...,n. 


But  then  each  such  H*  is  small,  and  therefore 


♦  <RJ)  £  u  *(HJ)  £  D j , 

iSiSj 


But  by  the  definition  of  1,  |Dj|  2  n2/4.  □ 
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Letting  k  satisfy  Lemma  3,  we  find  that  satisfies  the  hypothesis  of  the 
Boundary  Lemma,  so  that 

I  3  (Dfc)  I  *  /ZlDjJ1^2  2 

Lemma  4:  If  ip  is  the  number  of  large  trees  H*  along  an  admissible  path  P,  then 

ik  *  lp 2T- 

Proof:  Let 

Qt  ■  {v*  e  H*,  large:  for  some  small  H*  and  x*  c  H*,  dg(x*,v*)  <.  T}. 

i.e.,  Qt  is  the  set  of  vertices  in  large  H*  which  are  reachable  from  some  node  in 
a  small  H*  by  a  path  of  length  at  most  T.  We  show  that  |3(Dk>|  i  I QT I  by  defining 
an  injection  g  :  3(Dk)  •+  QT*  For  y  c  3(Dk>,  choose  some  x  c  adjacent  to  y. 

Let  x*  be  a  copy  of  x  in  a  small  H*,  let  y*  be  a  copy  of  y  such  that  dG^(x*,y*)  s  T, 
and  set  g(y)  -  y*.  Since  #g(y)  -  #(y*)  -  y,  g  is  one-one.  Thus,  from  (2), 

IQtI  *  |3(Dfc)|  * 


|QtI  s  |{H*  :  H*  large}) 


•  | {v*  :  v*  €  H*,  large;  v*  within  distance  T  of  root  of  H*}| 


*  ap  •  r 


,T+l/2 


To  complete  the  proof,  we  now  show  that  there  are  at  least  2 


admissible 


paths.  Since  each  admissible  path  corresponds  to  a  distinct  leaf  of  G*  and 


G  s  G*,  we  have 
n  o,i 


n  2~T 

7?  S  | *_1 (V) )  S  SJV)  -  Sn2 


and  the  result  follows. 


+  Without  loss  of  generality,  we  assume  that  no  leaf  of  G*  is  a  bookkeeping  node. 


~xni.ii  ii  -x 

Lemma  5:  There  exist  at  least  2  admissible  paths,  where  “  7f  *  ^  ' 

Proof:  We  prove  the  result  by  showing  that  at  least  independent  binary 

choices  must  be  made  to  construct  an  arbitrary  admissible  path.  Consider  a 

partial  admissible  path  . x^  (i.e.,  the  initial  segment  of  an  admissible 

path).  If  only  one  subtree  of  x^  is  large,  then  the  admissible  path  can  only 
be  extended  down  that  subtree.  However,  if  both  subtrees  are  large,  then  the 

4 

admissible  path  can  be  extended  down  either  subtree  without  violating  the 
condition  (a-b).  By  Lemma  4,  there  are  at  least  A large  subtrees  along  every 
admissible  path,  and,  for  each  such  subtree,  there  is  a  node  in  the  admissible 
path  with  two  large  subtrees.  OD 

By  using  the  modeling  strategy  detailed  in  [1],  we  obtain  the  following: 

Corollary :  For  each  n  there  is  an  n  statement  goto  program  Q  such  that  for  any 
structured,  simulation  of  Q  either 

1)  the  simulating  program  is  slower  than  Q  by  a  factor  of  c^  log  n,  or 

or  c2nC3 

2)  the  simulating  program  is  larger  than  Q  by  a  factor  of  2 

An  interesting  interpretation  of  this  result  as  a  space-time  tradeoff  is 
shown  in  Figure  1,  which  illustrates,  for  fixed  n  >  0, 


S(T,n)  2  2 


n/2 


For  any  fixed  value  K  S  T  <  c^  log  n,  limiting  the  loss  of  time  efficiency  in 
the  simulating  program,  the  shaded  region  of  Figure  1  shows  the  only  values  of 
S,T  which  are  achievable. 
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INTRODUCTION 

There  are  two  natural  methods  of  limiting  the  use  of  labels  in 
structured  programs:  bounding  the  number  of  labels  that  can  be  referenced 
by  a  single  statement  and  bounding  the  total  number  of  labels  which  can 
appear  in  a  program.  It  is  implicit  in  an  argument  of  [1]  in  the  former 
case  and  in  the  unbounded  analog  of  both  cases  that  a  genuine  limitation 
is  imposed  and  power  increases  with  the  number  of  labels.  We  show  here 
that,  in  the  latter  case,  programs  with  differing  bounded  numbers  of  labels 
are  provably  equivalent  in  the  precise  sense  of  [1,2],  From  [3]  it  is  known 
that  suitable  restrictions  on  the  notion  of  equivalence  result  in  provable 
differences  among  these  constructs;  these  restrictions,  however,  rely  on  the 
details  of  program  organization.  Hereafter,  we  deal  only  with  combinatorial 
arguments.  Further  motivation  for  the  combinatorial  properties  in  the 
sequel  may  be  found  in  [1]. 

PRELIMINARIES 

A  directed  graph  G  is  composed  of  a  set  of  vertices,  V(G),  and  arcs 
E(G)  c  V(G)  x  V(G).  The  arcs  (x,y)  and  (y,x)  together  form  an  edge  of  G. 

Arcs  and  edges  are  represented  by  directed  and  undirected  arrows,  respectively. 
A  path  from  x  to  y  is  a  sequence  of  arcs 

(x,  x1),  (x1,  x2),  ...  ,  (*n_2,  xn_i^>  (xn_i»  y) * 
and  such  a  path  is  said  to  be  of  length  n.  We  define  the  distance  metric 
d„(x,y)  to  be  the  minimum  of  the  lengths  of  all  paths  from  x  to  y. 


A 
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A  binary  tree  with  root  x  is  a  directed  graph  that  is  either  a  single 

vertex  x  or  contains  a  vertex  x  connected  by  edges  to  root(s)  of 

subtree(s)  G  ,  i  <  2.  Note  that  d  is  symmetric  on  binary  trees.  Let  G  be 

c 

a  binary  tree  with  root  x.  and  consider  the  path  (x_,  x, ),...,  (x  , ,  x  ), 

u  u  l  n-i  n 

where  x^  ^  x^  for  i  f  j.  We  define  the  following  relations  on  G: 

(1)  Xj  is  a  descendent  of  x,  (0  £  i  <  j  £  n) 

(2)  x^  is  an  ancestor  of  x..  (0  £  i  <  j  £  n) 

(3)  x.  is  the  father  of  x  (0  <  i  <  n),  and  we  write  x,  =  f  (x .  . ) 

1  1+1  —  1  0  1+1 

(4)  x^+^  is  a  son  of  x^  (0  £  i  <  n) 

(5)  x  is  leaf  of  G  if  f  (y)  ^  x  for  all  y  e  V(G) 


In  a  binary  tree  G,  the  subtree  with  root  x  is  denoted  by  G 


An  ancestor  tree  is  a  directed  graph  G  whose  arcs  may  be  partitioned  into  two 
maximal  subsets  E^,  E ^  such  that  (V(G),  E^)  is  a  binary  tree  and  if  (x,y)  t 
then  y  is  an  ancestor  of  x  in  the  binary  tree  (V(G),  E^) .  Thus,  in  an  ancestor 
tree  a  vertex  may  be  connected  by  an  arc  to  any  of  its  ancestors.  We  use  the 
special  notation  x  ■*  y,  if  (x,y)  e  E„.  The  following  terminology  is  suggested 
by  [l]:  if  y  4  x  then  x  is  a  label  and  y  is  an  exit, 

cl 

We  then  say  that  an  ancestor  tree,  G,  is  a  k-label  program  if  it  contains  at 
most  k-labels;  we  also  say  that  G  is  a  k-exit  program  if  it  contains  at  most 


k-exits . 
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SPACE-TIME  BOUNDED  SIMULATIONS 


The  following  definition  is  from  [1];  it  introduces  a  fundamental 

mechanism  for  comparing  programs.  Let  G,  G*  be  directed  graphs.  We  say 

that  G*  simulates  G  with  space  dilation  S  >  0  and  time  dilation  T  >  0, 

written  G  <_  _  G*  if  there  is  a  map  (called  an  embedding  of  G  in  G*) . 

0,1 


<&:  V (G*)  V (G)  u  {A},  A  $  V(G)  u  V(G*) 


such  that: 


(1)  Vue  V (G) 


0  <  |  <3>_1(u)  j  <  S  , 


(2)  V  v*  e  V(G*)  such  that  4>(v*)  f  A 


V  w  £  V(G)  such  that 


dG($>(v*),  w)  <  00 


3  w*  e  V(G*)  such  that  $(w*)  =  w,  and 


dG* (v*»  w*>  1T  ’  dG($(v*>,  w)  . 

If  I1  is  an  embedding  and  $(u*)  =  A,  then  u*  is  said  to  be  a  bookkeeping  vertex; 
on  the  other  hand,  if  $(u*)  =  u  f  A,  then  u*  is  said  to  be  a  copy  of  u. 

Clearly,  in  a  simulation  of  G  with  space  dilation  S,  no  vertex  of  G  can  have 
more  than  S  copies  in  the  preimage  of  the  embedding.  In  the  sequel,  we  will 
avoid  some  notational  unpleasantness  by  agreeing  that  A^,  X^,  ...  always  denote 
bookkeeping  vertices  and  that  u^*,  ...  u^*,  k  <_  S,  always  denote  copies 


t 
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It  is  known  that  for  every  S,  T  >  0,  there  is  an  ancestor  tree  which 
cannot  be  simulated  with  space  and  time  dilation  S  and  T  by  any  1-exit  program 
and  for  every  S,  T  there  is  a  1-exit  program  which  cannot  be  simulated  with 
space  and  time  dilation  S  and  T  by  any  1-label  program.  This  is  very  suggestive 
of  a  hierarchy  in  the  number  of  labels  for  the  relation,  among  ancestor 

trees. 

We  can  now  show  that  such  hierarchies  collapse.  That  is,  we  show  that  for 
every  k  >  1,  there  is  a  T  >  0  such  that  every  k-label  program  can  be  simulated 
by  some  1-label  program  with  space  dilation  S  =  1  and  time  dilation  T  =  T(k). 

We  begin  by  considering  a  general  embedding  procedure  which  dilates  space  by 
S(k)  >  1,  since  this  result  is  technically  easier. 


AN  OBSERVATION 


Let  G  be  a  binary  tree  with  root  x  and  consider  a  vertex  y  which  is  not 
the  father  of  two  vertices.  G  can  be  modified  by  viewing  y  as  the  root  and 
inverting  the  father-son  relationship  along  the  path  from  x  to  y.  Obviously, 
the  resulting  graph  is  still  a  binary  tree;  we  denote  this  tree  by  Gy. 


MAIN  SIMULATION  RESULT 


Let  H  be  an  ancestor  tree  and  choose  a  vertex  x  of  H  such  that  Among  the 

descendents  of  x  there  is  exactly  one  label  y.  Let  H'  be  obtained  from  H  by 

replacing  the  subtree  by  the  graph  shown  in  Figure  1. 

In  this  graph,  K  is  with  its  vertices  subscripted  by  "1",  L  is 
f  (v) 

(H  -  H  )  H  with  its  vertices  subscripted  by  "2",  and  M  is  H  with  its 
X  y  y 

vertices  subscripted  by  "2".  Thus  a  is  a  "second  copy"  of  fu(y).  In  addition, 

H 
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each  arc  u  +  y  or  u  +  x  Is  replaced  by  u_  -*■  X0,  u_  A_,  while  every 
3  3  L  3  Z  Z  3  Z 

u  -*•  v  with  v  4  x,y  is  replaced  by  u.  -*■  v  and  u_  -*■  v.  Then  we  have 

3  13  c  3 

H  £  2  3  which  may  be  proved  easily  by  a  case  analysis  of  the  possible 
arcs  in  H  and  their  copies  in  H'.  Note  further  that  if  x  is  not  a  label, 
then  H'  has  the  same  number  of  labels  as  H,  while  if  x  is  a  label,  the 
total  number  of  labels  is  decreased  by  one. 

We  now  prove  that  any  k-label  program  (k  £  2)  can  be  simulated  by  a 
(k-1) -label  program.  To  this  end,  let  H  be  a  k-label  program,  k  £  2. 

Two  cases  arise. 

Case  I.  Some  vertex  x  contains  exactly  one  label  in  each  of  its  subtrees. 

If  the  root  of  either  subtree  is  a  label,  no  transformation  is  required  for 
that  subtree.  In  all  other  cases,  replace  each  subtree  as  above  to  yield  a 
k-label  program  H',  where  H  £  _  ,  H'  and  in  H'  the  sons  of  x  are  both  labels. 
Let  the  sons  of  x  by  y,z  e  V(H’).  Now,  clearly  each  arc  u  -*■  y  or  u  •+  z  can 

3  3 

be  replaced  by  an  arc  u  +  x  at  the  expense  of  dilating  path  lengths  by  one 

3 

arc.  Hence,  this  transformation  has  the  effect  of  replacing  the  pair  of 
labels  y,z  by  a  single  label  x.  If  H"  is  the  result  of  such  a  transformation, 
then  since  dH„(x,y)  =  dH„  (x,z)  =  1,  we  have  H  £  ^  ^  H". 

Case  II.  Some  label  vertex  x  has  exactly  one  label  y  as  a  descendent.  The 
transformation  given  above  when  applied  to  the  subtree  rooted  at  x  yields  a 
k-1  label  program  H'  such  that  H  £  2  3  H'. 

Thus,  every  k-label  program  is  simulated  with  S=  2,  T  =  4  by  a  k-1  label 
program.  We  have  immediately  that  every  k-label  program  H  is  simulated  by 
some  1-label  program  G  with  S,  T  independent  of  |V(H)|;  more  specifically 


k-1  k-1 

S  =  2  ,  T  =  4  .  It  is  easily  seen  that  for  every  S,  T  there  is  a 

1-label  program  which  cannot  be  simulated  by  any  0-label  program  (i.e., 
by  a  binary  tree). 

AN  IMPROVEMENT 

The  vertex  duplication  in  the  construction  above  is  somewhat  artificial; 

it  is  used  only  to  keep  track  of  "end  points"  of  circuits,  and  we  might  try 

to  use  some  inherent  symmetry  in  the  problem  to  avoid  such  duplication.  In 

fact,  such  duplication  need  never  be  introduced.  That  is,  we  can  prove  that 

for  every  k-label  program  H,  (k  >_  2)  there  is  a  (k-l)-label  program  H'  such 

that  H  <  ,  .  H' . 

-  1,4 

Let  denote  the  regular  graph  on  n  vertices  with  degree  2,  shown  in 

Figure  2.  If  V(B  )  =  {a  ,  ...  ,  a  },  then  B  <  ,  _  G  ;  and  B  <  ■,  ,fc, ,  when 
n  1  n  n  —  1,3  0  n  —  1,4  1 

Gq  and  G^  are  as  shown  in  Figures  3(a)  and  3(b),  respectively. 

The  first  simulation  is  apparently  the  better  of  the  two,  but  in  fact  the 
simulation  H  ^  ^G^  is  the  one  which  is  to  be  preferred  for  the  simulation  to 
be  described. 

Using  this  transformation,  given  a  tree  H,  as  shown  in  Figure  4(a),  we 
embed  U  <_  .  ,H*  where  H*  is  as  in  Figure  4(b).  As  in  the  case  B  <  .  .G,  we 
now  have  a^  and  a^  relatively  close  to  each  other.  For  obvious  reasons,  we  call 
this  transformation  a  folding  of  H. 

Now  suppose  that  H  is  a  subtree  of  a  k-label  program  Ho(k  2)  that  a^  and 
an  arc  labels,  V(H^)  has  no  other  label,  and  none  of  a2»  ...  ,  an-1  is  a 

label.  We  then  form  H'  by  folding  H  and  replacing  each  u  +  a^  by  u  +  a^.  If 
no  such  subtree  H  of  exists,  then  no  label  is  related  to  any  other  as  either 


a  descendent  or  an  ancestor.  But  since  each  {x,y}  £  V(Hg)  share  a  common 
ancestor  (viz.  the  root  of  Hq)  choose  any  two  labels  x,y  and  let  z  be  their 
deepest  common  ancestor.  This  identifies  subtrees  of  the  form  H  with  a^  =  z 
and  £  {x,y}.  Fold  each  of  these  subtrees  and  replace  each  arc  u  x  or 

u  +  y  by  u  +  2.  Then,  if  the  resulting  ancestor  tree  is  H' ,  we  have 


Note  that  the  passage  from  a  k-label  program  to  a  1-label  program  still 
k-1 

requires  T  *  4  .  It  is  not  known  if  this  is  (asymptotically)  the  best 

possible. 
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Programmers  and  data  structure  designers  are 
often  forced  to  choose  between  alternative  structures. 
In  storing  these  structures,  preserving  logical 
adjacencies  or  “proximity"  is  usually  an  important 
consideration.  The  combinatorial  problem  of  storing 
arrays  as  various  kinds  of  list  structures  is  examined. 
Embeddings  of  graphs  are  used  to  model  the  loss  of 
proximity  involved  in  such  storage  schemes,  and  an 
elementary  proof  that  arrays  cannot  be  stored  as  linear 
lists  with  bounded  loss  of  proximity  is  presented. 
Average  loss  of  proximity  is  then  considered,  and  it  is 
shown  that  arrays  cannot  be  stored  as  linear  lists  with 
only  bounded  loss  of  average  proximity,  hut  can  be  so 
stored  in  binary  trees.  The  former  result  implies,  for 
instance,  that  row  major  order  is  an  asymptotically 
optimal  storage  strategy  for  arrays. 
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Efficient  algorithms  often  require  specific  data 
structures  on  which  to  operate.  In  many  of  the  algo¬ 
rithms  of  |1],  for  instance,  the  running  time  depends 
critically  on  the  number  of  probes  of  the  data  structure 
needed  to  access  a  single  data  item.  In  practical  com¬ 
putation.  however,  many  factors  interfere  with  optimal 
data  organization.  Among  the  most  important  of  these 
factors  is  the  locality  of  references  in  data  structures. 
Since  most  data  structures  are  arranged  linearly  in 
memory,  a  set  of  local  references  to  the  structures  can 
require  accessing  of  data  elements  widely  separated  in 
memory.  This  is  particularly  relevant  in  paging  envi¬ 
ronments  where  a  page  fault  may  occur  during  access 
of  elements  which  are  logically  adjacent  in  a  data 
structure. 

In  |6),  Rosenberg  examined  the  problem  of  storing 
arrays  as  a  linear  structure  with  bounded  loss  of 
proximity  between  data  elements  and  showed  that  any 
such  storage  scheme  must,  in  the  worst  case,  induce 
unbounded  loss  of  proximity.  In  previous  papers  (2. 
4],  we  considered  the  proximity-preserving  issue  in  a 
more  general  setting.  We  used  graphs  to  represent 
data  structures  as  described  in  |3):  vertices  represent 
data  elements  or  nodes,  and  edges  represent  logical 
adjacencies;  if  G  and  H  are  graphs,  we  write  G  s,  H 
if  G  can  be  “stored"  as  H  so  that  no  adjacent  nodes  of 
G  are  more  than  a  distance  T  apart  in  H .  Using  this 
model  we  were  able  to  show  that  if  G„  is  an  n  x  n 
array  and  H  is  any  of  a  very  general  type  of  structure 
(including  as  subcases  linear  lists,  binary  trees,  and 
threaded  lists  |5)).  then  G„  W  only  if 

T(n)  >  (1/3)  log*  -  2/3.  (II 

In  particular.  ( 1 )  shows  that  arrays  cannot  be  stored 
in  trees  or  lists  with  bounded  loss  of  proximity,  extend¬ 
ing  the  result  of  Rosenberg  (6). 

The  lower  bound  represented  by  inequality  (1)  is  a 
worst-case  result.  Since  algorithms  which  manipulate 
data  structures  tend  to  "look  at"  all  of  the  data,  it 
seems  natural  to  investigate  the  average  loss  of  prox¬ 
imity  involved  in  storing  arrays  as  various  list  struc¬ 
tures.  We  find  a  distinction  between  linear  memory 
and  arbitrary  lists:  arrays  can  be  stored  as  nonlinear 
lists  with  bounded  loss  of  average  proximity,  but  cannot 
be  so  stored  in  linear  memory. 


2.  Graphs  and  Embeddings 

By  a  graph  G  =  (V,  £)  we  mean  a  set  V  of  vertices 
and  a  set  £  of  unordered  pairs  of  vertices,  the  edges. 
Note  that  all  graphs  we  discuss  are  undirected,  i.e.  {x, 
y)  fz  E  iff{y,x)  £  E.  An  edge  between  two  vertices! 
and  y  is  represented 

x  y 
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A  path  of  length  n  between  * ,  y  E  V  exists  if  there  is  a 
sequence  of  edges  {x0tx,}.(*i,x,},  .  .  .  .{x.-i.x.)  with 
x0  =  x  and  =  y.  If  G  =  (V,  £)  and  x,  y  E  V,  then 
dc(x ,  y)  is  defined  to  be  the  length  of  a  minimal  length 
path  between*  andy .  A  graph  G  is  intended  to  model 
a  data  structure;  therefore  we  think  of  the  vertices  of 
G  as  nodes  of  the  structure  and  edges  of  G  as  repre¬ 
senting  logical  adjacencies  in  the  structure. 

We  shall  consider  three  classes  of  data  structures, 
defined  by  the  classes  of  graphs  which  represent  them. 

Lines.  A  line  L  is  a  graph  ((x,},^, . «„  E)  with 

{*i,  x ,(  £  £  iff  /  =  /  +  1 .  For  example, 

*i  x,  x3  x„ 

o - o - o - -  - - o 

represents  a  line  withn  nodes. 

Binary  Trees.  A  binary  tree  is  a  graph  which  is 
either  empty  or  consists  recursively  of  a  root  connected 
by  edges  to  the  roots  of  binary  trees  called  the  left  and 
right  subtrees  of  the  root.  We  assume  the  elementary 
properties  of  trees  as  described  in  ( 1 , 3). 

Arrays.  Ann  x  n  array  G,  is  a  graph ({*t.j}i,jEu . «>, 

E)  with{*,,j,  E  E  andjjtj.j,  jri+1-i}  €  £  for  all  1  £ 
i  £  n  and  1  £  j  <  n.  The  3x3  array  Gs  is  shown  in 
Figure  1 .  Modeling  n  x  n  arrays  as  graphs  of  the  form 
G«  does  not  recover  the  notion  of  "randomly”  access¬ 
ing  array  elements.  But.  as  noted  in  |4.  6],  algorithms 
which  operate  on  arrays  tend  to  access  array  elements 
"locally,”  e.g.  along  rows  and  columns  or,  as  in  the 
Strassen  matrix  multiplication  algorithm,  as  2  x  2 
subarrays  [1].  For  these  purposes  the  above  represen¬ 
tation  of  arrays  is  entirely  faithful  to  the  adjacency 
properties  of  arrays. 

Given  two  classes  of  graphs  X  and  Y  which  repre¬ 
sent  distinct  data  structures,  it  is  often  natural  to  ask 
whether  or  not  each  G  E  X  can  be  “represented”  as 
some  HEY  in  such  a  way  as  to  preserve  the  accessing 
characteristics  of  G.  Furthermore,  we  may  want  such  a 
representation  of  G  to  preserve  proximity ;  that  is,  the 
representation  of  G  in  H  should  be  such  that  adjacent 
nodes  in  G  are  represented  within  some  bounded 
distance  of  each  other  in  H .  The  need  for  preserving 
proximity  in  this  fashion  has  been  defended  by  Rosen¬ 
berg  (6). 

The  combinatorial  aspects  of  this  sort  of  represen¬ 
tation  are  recovered  by  the  following  definition  (see 
Figure  2); 

Definition.  Let  G  =  (V,  £)  and  G*  =  (V*,  £*)  be 
graphs,  and  let  T  a  1 .  We  say  that  G  is  T -embeddable 
in  G*  (written  G  £T  G*)  if  there  is  an  injection  4>  :V 
— »  V*  (called  an  embedding)  such  that  for  all  {*,  y)  E 
£,  dc-Wx),  4>(y))  £  T. 

Example.  Let  Kn  =  ({*,,  ....  *„},  £)  be  the 
complete  graph  on n  nodes;  i.e.  {*„*,)  E  £  for  all  1  s 
if  j  s  n  withi  *  j.  Let  H  be  the  graph  shown  in  Figure 
3.  Then  it  is  easily  verified  that  Kn  £,  H  via  the 
embedding  <t>(x<)  =  y,  fori  =  1,  .  .  .  f  n. 


Fig.  I .  Graph  G, 
x„  Xl2  X,3 


Fig  2.  Illustration  of  G  sT  G*. 


A  main  result  of  |4]  is  that  arrays  cannot  be  stored 
as  binary  trees  with  only  bounded  loss  of  worst-case 
proximity.  The  precise  statement  of  this  result  is 
Theorem  1.  Let  {G.},,*,  be  the  class  of  n  x  n 
arrays.  If  G„  H  for  some  binary  tree  H,  then 

T(n)  ^  logn  -  3/2. 

For  completeness,  we  sketch  the  proof  of  Theorem 
1  (cf.  |4]).  We  require  an  additional  definition,  which 
will  also  be  useful  in  a  later  result. 

Definition.  Let  G  =  (V,  £)  be  a  graph  and  let  A  C 
V.  The  boundary  fl(A)  of  A  is  the  set  of  vertices  in  A 
adjacent  to  vertices  in  V  -  A,  i.e. 

d(A)  =  {x  E  A  .  for  some y  E  V  -  A,  {x,  >'}  £  £}. 

A  key  property  of  the  boundary  operator  is  given 
by  the  following  lemma  (see  |4]). 

Boundary  Lemma.  If  G„  =  (V,,  £„)  is  an  array 
and  A  Q  Vm  is  such  that  |  A  |  s  n2/2,'  then 

\A  |  s  2|0(A)|’. 

Proof  of  Theorem  1 .  Assume  G„  snil  H  where 
G,  =  (V,,  £„),  and  let  H  =  (K,  £)  be  as  shown  in 
Figure  4,  where  are  subtrees  of  H  and 

Hk  =  (V*\  £“')  has  the  following  property: 

rt*/4  £  |{*  E  V„:  <J>(x)  E  V’*’}|  £  r»Y2. 

Let  Ak  denote  {*  £  V',:  4>(x)  £  Vw'1}.  Then  |A*| 
satisfies  the  hypothesis  of  the  Boundary  Lemma,  and 
thus 

|a(A*)|  a  (l/V2)|A*r  ^  nlltfl. 

By  definition,  if*  £  d(A*),  thendc>(*,y)  =  1  for  some 
y  £  V  -  Ak,  and  thus,  since  G„  £„*,  H, 

dj* I>(x),  <1 >(y»  £  T(n). 

But 

|{v  E  V'*’:dH(u,  v)  £  Tip) 


‘  If  S  is  a  Kt.  |S|  denotes  the  cardinality  of  S 


a _ , 

is _ _ 
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Fig  3  Graph  H 


r 


Fig  4  Decomposition  of  H 


H, 


Hk_,  Hk 

for  some  u  G  V  -  £  2T|’"; 

therefore 

7'-  2  |d(A*)|  a  n/2J2, 
or 

T(n)  2  log/I  -  3/2.  □ 

Actually,  in  |2|  we  prove  a  much  stronger  result:  If 
we  allow  embeddings  ofn  x  n  arrays  which  not  only 
preserve  proximity  by  a  factor  of  T(n)  but  which  also 
may  “split"  a  vertex  at  most  S(n)  times,  then 


linear  structure,  such  as  a  linear  list  or  in  linear 
memory. 

Theorem  2.  Let  {G,},*,  be  the  class  of  n  x  n 
arrays.  If  G.  L  for  a  line  L ,  then 

A(n )  2  n/12. 

Proof.  Assume  G  sj}*,  L  where  L  =  ({*„  .  .  .  , 
Jr.),  £)  is  a  line.  It  is  clearly  sufficient  to  assume  that 
m  =  n1  since,  if  G„  SJJJ,  L  by  an  embedding  <t>  andxk 
£  <I»( V„),  then  4>  also  defines  an  A(n)  average  embed¬ 
ding  of  G„  into  the  line  with**  removed. 

Let  Dt  -  {O-'fo):  1<;£  /'}  for  l<i£  n2/ 2  so 
that  |  £>i|  =  i  £  n*/2  (see  Figure  5).  For  each y  E  rt(Dt) 
there  is  an  edge  between  y  and  some  node  not  in  D,  \ 
hence  there  is  a  path  between  4>(y)  and  some  node  not 
in  <t>(D,)  which  must  pass  through  x,.  Since  |D,|  s 
n2/ 2,  by  the  Boundary  Lemma  each  Q>(Dt)  makes  a 
contribution  of  at  least  (|Dj|/2),a  to 

2  dc-CPOr),  <I>(y)). 

For  suppose  <t>  *(Jc*)  and  <I>~  V,)  (k  <  l )  are  adjacent 
in  G„ .  Then  the  I  -  k  +  1  that  this  adjacency  should 
add  to  the  sum  accrues  by  x„'s  membership  in  <P{Dk), 

. 4>(D|- 1) .  each  membership  contributing 

1  to  the  sum . 


Therefore 


I  2 

£  dR.(«J>(x).  <Wy)>  2  |D,r 

VZ  i-i 


Hence 


1  *  |  rm*H  3 

'viSv'!!v5l 


T(n )  +  log  log  S(n )  2  c  •  log  n 

where  c  is  a  fixed  positive  constant  independent  ofn  . 

3.  Preserving  Average  Proximity 

The  relation  sr  represents  a  worst-case  analysis  of 
proximity-preserving  transformations.  Since  data  struc¬ 
tures  are  frequently  accessed  "uniformly"  in  that  the 
probability  of  access  to  a  particular  node  by  a  particular 
edge  is  uniformly  distributed .  Theorem  1  leaves  open 
the  question  of  whether  or  not  there  are  ways  to  store 
arrays  as  various  other  structures  which ,  on  the  average , 
preserve  proximity.  To  investigate  this  problem  we 
need  an  additional  definition. 

Definition.  Let  G  =  (V,  £)  and  G*  =  (V*,  £*)  be 
graphs.  We  say  that  G  is  A -average  embeddable  (writ¬ 
ten  G  sy«  G*)  if  there  is  an  embedding  •DiV'  -*  V* 
such  that 

Z  <M<t>(jr),<t>(y»sA|£|. 

I  f.HKS 

We  first  consider  the  case  of  storing  an  array  as  a 


nV 6  s  A(n)|£„|  =  A(n)(2n2  -  2r»), 

which  yields  A(n)  2  n/12.  □ 

An  interesting  interpretation  of  this  result  is  that 
any  reasonable  sequential  method  of  array  storage  is 
asymptotically  optimal  with  respect  to  proximity.  Con¬ 
sider.  for  instance,  ar.  embedding 4>  of  G«  into  a  line  L 
which  places  nodes  of  G„  in  row-major  order;  that  is. 
4>(X|j)  =  Xi-i+u-im  ■  With  this  embedding  intoL, 

s(V*,m  L. 

By  Theorem  4.2  of  |6],  row-major  storage  is  also 
optimal  for  worst-case  proximity. 

In  contrast  to  Theorems  1  and  2.  we  have  the 
following. 

Theorem  3.  When  n  is  a  power  of  2,  there  is  a 
binary  tree  H  such  that  C,  £gv"  H. 

Proof.  Let  G,  =  (V„  £«)  be  given  and  suppose  n 
=  7  for  some  k  .  We  shall  describe  a  recursive  method 
of  embedding  G „  into  a  complete  binary  tree.  Divide 
G„  into  fourn/2  *  n/2  subarrays  (see  Figure  6(a))  and 
attach  the  subanays  as  leaves  of  a  complete  binary  tree 
H  as  shown  in  Figure  6(b)). 
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Fig.  5  Embedded  edges  in  L  . 


♦  ,  x, 


♦  ID, i 


Fig.  6.  llluslraiian  of  embedding  of  Theorem  3.  (a)  Recursive 
decomposition  of  C, .  (b)  Embedding  G„  into  complete  binary  tree 


•y? 


n 


A„  =  |V„.E„I 


Clearly 

U-.vieC.  ^2) 

<4  y  UtlflEf  If 

where  iV  is  the  sum  of  the  lengths  of  the  2 1  paths 
between  the  4n  -  4  nodes  along  the  boundaries  of  the 
Au.  By  continuing  this  process  recursively,  we  can 
suppose  that  the  Au  of  Figure  6(b)  are  themselves 
complete  binary  trees,  and  therefore  we  may  bound  N 
in  inequality  (2)  by 

N  s  2/i[2  log(n’/4)  +  4]  =  8n  logn . 

This  leads  to  the  following  recurrence  for f[n),  the  sum 
of  the  lengths  of  paths  which  correspond  under  <t>  to 
embedded  edges  of  an  n  x  n  array  : 

AV  =  0 

An)  s  4f(n/2)  +  8n  logn  . 

The  solution  to  this  recurrence  satisfies 


An)  s  16(4"*")  -  8n  logn  -  16n. 
Thus 


A  =  2  *«•<*>.  ♦<r»/l*. 

le.toet.  / 


2(n*  -  n) 


s  -  4n  log  w  -  8n  s  g 
tf  -  n 


□ 


For  n  not  a  power  of  2,  Theorem  3  clearly  can  be 
modified  to  hold  with  a  slightly  larger  constant. 

Even  though  the  number  of  vertices  reachable  by  a 


path  of  length  k  grows  as  ?  in  a  complete  binary  tree 
versus  A*  in  an  array,  it  is  important  to  note  that  not 
any  embedding  technique  will  work  in  the  proof  of 
Theorem  3.  Similarly,  not  every  graph  in  which  fewer 
than  2*  vertices  can  be  reached  by  paths  of  length  k 
can  be  embedded  by  using  the  recursive  decomposition 
of  Theorem  3;  the  graph  must  have  the  property  that 
it  can  be  “cut”  into  regions  with  boundaries  that  are 
not  too  large.  A  basic  question  to  be  resolved  is 
whether  or  not  any  family  of  graphs  with  neighbor¬ 
hoods  growing  slower  than  2*  are  A  -average  embedd¬ 
able  in  trees  for  some  constant  A  . 
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Page  626,  first  paragraph,  second  sentence:  replace 
after  “communications”  by  In  column  I,  line  16 
should  begin  “...Considerable...”.  In  Column  2,  the  first 
term  of  equation  (2)  should  be  a,i  and  the  first  word  that 
follows  that  equation  is  “of.  On  page  627,  the  title  to 
section  2.5  begins  with  “Sorting.” 

On  page  628,  in  column  2,  line  21.  p(s)  should  be  jr(S). 
In  column  2  of  page  629.  line  15  has  a  comma  before 
“holds”  and  on  line  18  “E.  Blum”  should  be  “M.  Blum.” 
On  page  630,  column  I,  the  expression  on  the  first  and 

second  lines  should  read  ”oo(oo,. . .  ,dn) . <Mao . a«)”,  and 

on  tine  8  the  author  is  Motzkin.  On  page  631,  column  I, 
line  9,  “0(n*)”  should  be  ”o(n*)”.  In  column  2,  line  12 
should  read  “such  that  for  every  no  <  n"  and  line  14 
should  read  “satisfying  (1)  f  (H)  “  n,  and  (2)...”.  The 
theorem's  statement  should  end  with  “Here  f  (H)  de¬ 
notes  the  length  of  H .” 

Line  4  of  column  I  of  page  633  should  read  “secrecy.”, 
and  the  following  paragraph  should  end  with  a  question 
mark. 
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A  graph,  G,  consists  of  vertices  V(G)  and  edges  E(G);  paths  are  sequences 
of  vertices  connected  by  edges,  and  path  length  is  defined  by  the  number  of 
edges  along  the  path.  For  x,y  e  V(G)  we  use  dG(x,y)  to  denote  the  length  of  a 
minimal  length  path  between  x  and  y,  if  such  a  path  exists.  An  n  *  n  array, 
Gn,  consists  of  vertices  V(Gn)  =  {x^^  j<n  and  edges  which,  except  at  the 
obvious  extremal  conditions,  are  linked  as  follows: 


*  The  work  of  both  authors  was  supported  in  part  by  the  U.S.  Army  Research 


Office,  Grant  No.  DAA029-76-G-0338. 
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(xi,r  Xi+l,j)  e  E(0n)*  3,1(3 


XiJ+l 


)  e  E(Gn). 


Such  graphs  are  also  called  rook-connected .  A  binary  tree  is  as  defined 
in  [1,2];  that  is,  a  binary  tree  H  is  a  connected  acyclic  graph  with  a 
designated  root  and  ancestor  -  descendent  relation  defined  so  that  each 
x  e  V(H)  has  at  most  two  immediate  descendants . 

Let  us  write  G  <_  when  there  is  a  one-one  mapping  (called  an  em¬ 
bedding  of  G  into  H  4>  :  V(G)  -*■  V(H),  such  that  for  all  (x,y)  e  E(G), 


dH(*(x),  *(y))  <  T. 

As  described  in  [1],  it  follows  frcm  simple  volumetric  arguments  that  for 
all  T  >  0,  there  exists  a  binary  tree  H  such  that  H  £  ^Gn,  for  all  n  >  1. 
The  corresponding  intuition  for  doer  not  hold.  It  would  now  seem 
that  since  in  G 


|  {x  e  V(G  ):  dq  (x,y)  <  k)  |  *  0(k‘  )  (1) 

1  'n 

while  in  a  complete  binary  tree  H 

|  (x  e  CH):  dH  (x,y)}  |  >  2k_1  (2) 

that  Gn  £  would  now  be  pos.  ^ble  for  some  bounded  T.  It  is  therefore 
somewhat  surprising  that  G^  <_  only  if 

T  >  log  n  -  1.5 


(See  [1],  for  details). 
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It  is  still  obvious  from  inspection  that  neighborhoods  in  trees  can  be 
much  more  densely  growing  than  neighborhoods  in  arrays,  and  therefore  by 
choosing  a  suitably  global  measure  of  loss  of  proximity,  this  difference 
should  be  distinguishable.  In  [2]  we  considered  such  a  measure: 

G  <  ec^’e  G*  if  for  sane  embedding  4>  :  V(G)  ->■  V(G*) 


J<iG*(*(x),  *(y»  <  A  |  E(G)  |. 

(x,y)  e  E(G) 

It  follows  [2]  that  for  b  =  8.5 

G  <  edSe  H 
n"  b 

+ 

for  some  binary  tree  H.  This  upper  bound  can  be  inproved  to  b  *  7  -  o(l) 
The  relation  <  ec^e  may  be  thought  of  as  averaging  -  with  relative 
frequencies  uniformly  distributed  to  the  edges  E(G)  -  over  the  edges  of  G. 
We  now  make  a  more  global  definition  which  finally  may  be  used  to  recover 
our  original,  although  imprecise,  intuitions  about  path  lengths  in  binary 
trees.  We  will  essentially  average  over  shortest  paths: 

G  <  pa^hs  g*  if  one  Is  an  embedding  $  :  V(G)  **■  V(G*)  such  that 


where 

rn  =2  dQ*  WA.SCy)) 
*(x),*(y) 


t  L.  Snyder,  private  conmunication. 


48 


and 


A  =  V  dn(x,y). 


n  ^  G 

x,y 


We  then  have  the  following  theorem. 


Theorem.  For  each  n  >  0,  let  A  be  the  least  real  number  such  that 
-  _  »  n 


p  paths  „ 
i  An  "• 


for  a  binary  tree  H.  Then 


lim  A  =  11m  T  /  A  =0. 

n 

n  -*■  00  n  -*■  00 


Proof  we  first  show 

An  = 

Let  us  choose  B^,  Bp  £  V(Gn)  so  that 

B1  =  *xij  •'  1  £  i>  J  £  n  / 

B2  =  ^xij  :  —  £  1»i  £  n> 

4 

so  that  |B1  x  Bp |  =  [n  /  16]2 .  Now  clearly,  for  any  (x,y)  e  *  Bp 

dG  ^x»y)  i  n  /  2, 


and  hence  by  definition 


A  >  n"*  /  512  =  ft(n5) 


n  — 


We  now  obtain  the  following  upper  bound  for  f 


n 


rn  =  0(n  log  n). 


As  in  [2]  let  Ay  c  V(Gn),  1  <  i,  j  <  2,  | A± j |  =  n  /  4, 


Denote  the  n  /  2  x  n  /  2  decomposition  of  G^  and  notice  that 


f(n)  <  4  r 


(?)  +  ?  n‘ 


log  n. 


4  4 

Thus  f(n)  <  an  log  n  +  gn  from  Which  the  theorem  follows  directly. 
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INTRODUCTION 

One  of  the  most  significant  changes  in  theoretical  computer  science 
has  been  the  recent  infusion  of  the  methods  and  problems  from  combinatorial 
analysis.  Among  the  most  powerful  combinatorial  theorems  which  have  been 
imported  to  computer  science  are  those  of  extremal  graph  theory  [1]:  in 
extremal  graph  theory,  one  is  interested  in  the  largest  (or  in  complementary 
problems,  the  smallest)  graph  which  avoids  (or  contains)  a  given  structure. 
Purely  combinatorial  results  (which  have  significance,  e.g.,  for  the  design 
of  circuit  boards)  have  been  obtained  by  Chung  and  Graham  [2]  and  by  Chung, 
Graham,  and  Pippenger  [3].  In  this  paper,  we  extend  this  theory  to 
encompass  results  concerning  data  structures. 

As  motivation  for  the  results  to  the  described,  note  that  many  of  the 
large  data  structures  manipulated  by  the  programs  described  in  [4,5]  have 
two  characteristics 

(i)  they  are  sequentially  accessed,  and 

(ii)  many  distinct  structures  convolve  in  the 
same  physical  memory. 

For  applications  of  this  sort,  it  would  obviously  be  desirable  to  have 
available  a  universal  data  structure  in  which  all  data  structures  from  a 
given  class  may  gracefully  reside.  In  view  of  (i),  by  "graceful"  we  mean 
that  the  sequential  accessing  characteristics  of  the  embedded  data  structures 
are  not  too  drastically  altered.  Let  us  measure  such  alterations  by  the 
dilation  of  logical  adjacencies  [6,7]  needed  to  embed  all  structures  from  a 
given  class  into  a  universal  structure;  this  is  then  a  complementary 
extremal  graph  theory  problem:  what  is  the  size  (number  of  edges)  of  the 
smallest  universal  graph  for  a  given  dilation  factor. 
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The  main  results  contained  In  this  paper  address  such  problems  from 
a  number  of  points  of  view. 

(1)  We  give  several  asymptotically  optimal  universal  data 
structures  for  graphs  of  n  vertices  when  average  dilation 
[7]  is  used  as  a  measure. 

(2)  We  discuss  a  universal  data  structure  for  graphs  of  n 
vertices  where  worst-case  dilation  is  used  as  a  measure  [6]. 

(3)  We  consider  variations  of  the  average  dilation  measure 
which  gives  favorable  comparisons  between  data  structures 
studied  in  [6,7]. 

(4)  We  consider  the  kinds  of  "sharing"  that  can  take  place 
between  "almost  linear"  and  "almost  complete  tree-like" 
structures. 

(5)  Finally,  we  propose  a  data  structure  embedding  model  which 
recovers  some  aspects  of  random  accessing  of  data  items, 
and  prove  a  space-time  tradeoff  which  seems  to  indicate 
that  no  savings  is  possible  in  RAM  models  which  assess 
accessings  costs  uniformly  [8]. 


PRELIMINARIES 

A  graph,  G,  is  defined  by  its  vertices,  V(G),  and  edges, 

E(G)  £  V(G)  *  V(G) .  Edges  are  assumed  to  be  undirected :  a  pair  of  vertices 

x,y  are  connected  if  either  (x,y)  e  E(G)  or  (y,x)  e  E(G).  A  path  between 

x„,  x  is  said  to  be  of  length  n.  The  distance  metric  d  (xn,  x  )  is  defined 
On  w  u  n 

to  be  n  if  there  is  no  shorter  path  than  Xg,...,xn. 

A  graph  represents  a  data  structure  in  the  obvious  way:  vertices 
represent  nodes  or  records  and  connectedness  models  logical  adjacency. 

The  following  relations  and  their  significance  for  data  structures  can  be 
found  in  [6,7].  Let  G,  G*  be  graphs.  We  say  that  G  is  T -worst  case 
embeddable  in  G*  (G^G*)  if  there  is  a  one-one  4> :  V(G)-*-V(G*)  such  that  (x,y) 
e  E(G)  implies 

d^($(x),  Hy))  <  T.  (1) 
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Similarly,  G  is  k-average  case  embeddable  in  G*  (G  <_  a^8  G*)  if  there 
is  a  one-one  $  as  above  such  that 

2  dc*($(x) »  *<y»  ±  A* lE(G) I •  (2) 

x.y 

connected 

In  [4,5],  comparisons  between  several  natural  classes  of  graphs  give 
asymptotic  bounds  on  T,  A  in  (1),  (2)  as  functions  of  |V(G)|.  Shortly 
after  the  announcement  of  the  results  of  [6],  R.  M.  Karp  suggested  to  us 
the  following  class  of  problems  connected  with  extremal  graph  theory: 
what  are  the  characteristics  of  < T  -  universal  data  structures;  i.e., 
those  structures  which  T -worst  case  embed  all  graphs  in  a  given  class. 

This  paper  grew  out  of  considering  these  problems. 

UNIVERSAL  GRAPHS 

Let  £n  be  a  given  class  of  graphs  G,  |v(G) [  ■  n.  Let  us  ask  about  a 
data  structure  which  is  _<T  or  <  universal  for  £n.  In  particular,  let 
us  define 

wUn,  T)  =  min  {|E(G)|:  Gn  e  £n,  Gn  <T  G>  (3) 

and 

a(4n,  A)  -  min  {|E(G)|:Gn  e  £n,  G°  <  aJ8  G}. 

For  T  =  1,  (3)  becomes  the  complementary  extremal  graph  problem  studied 
in  [2,3]. 

By  an  n-tree  G,  we  mean  a  connected  acyclic  graph  G,  with  |v(G)|  =  n. 
It  is  also  convenient  to  think  of  trees  as  rooted  in  the  following  sense: 
accompanying  G,  there  is  an  ancestor-descendent  relation  that  assigns 
direct  ancestors  and  direct  descendents  to  vertices  in  the  obvious  way  so 
that  a  vertex  with  no  ancestors  can  be  designated  as  the  root  of  the  tree. 


(Obviously  this  choice  is  not  going  to  be  unique,  but  we  assume  that  G 

is  not  characterized  until  such  a  choice  is  made) .  A  d-ary  n-tree  is  an 

n-tree  in  which  each  vertex  has  at  most  d  direct  descendents.  We  denote, 

respectively,  the  classes  of  n-trees  and  d-ary  n-trees  by  Tn  and  rn*  . 

d 

By  [2]  it  is  known  that  %n  log  n  <  w(rn,  1)  <  n^4^^,  k(n)  = 

[log  log  n)-1.  + 

The  upper  bound  was  improved  in  [3]  to 

w(rn,  1)  *=  0(n  log  n[log  log  n]2) 

The  bounds  on  a(Fn,  1)  are  apparently  not  elsewhere  considered. 

Superficially,  at  least,  all  interest  in  further  characterization  of 
(3)  is  destroyed  by  the  following  obvious 
Theorem.  For  T  2 

w(rn,  T)  =  n 

Of  course,  in  (3),  the  "target"  graph  G  may  have  unbounded  degree. 
Therefore,  it  is  natural  to  consider  w(?n,  T,  S)  and  a(£n,  T,  S)  where  in 
both  cases  the  target  graph  G  is  restricted  to  be  in  the  set  S.  Note  that 
now  the  theorem  just  cited  is  no  longer  obviously  true. 


*  Thus  r"  *  binary  trees  on  n  vertices. 

In  the  sequel,  we  use  log  x  for  log^x  and  £.nx  for  loggx. 


+ 
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The  best  that  is  known  is  the  upper  bound  of  [3]  (S  ■  all  cubic 
graphs) 

w(rn,  1,  S)  <  ^~exp  (log2n/2  log  2)  .  (4) 

It  is  not  obvious  that  when  (i)  "targets"  are  restricted  to  binary  trees 

and  (ii)  w(r",  T,  r”)  is  considered,  that  it  is  possible  to  do  any  better 

than  the  union  of  all  trees  in  r",  giving  a  structure  of  size  4n / £  . 

But,  we  have  the  following 

Theorem.  For  each  T  ^  1,  there  is  a  binary  tree  H,  such  that  G  H 
for  all  G  £  r",  and 

An|E(G)  |  <  ; 

or  in  other  words 

w(r£,  T,  r")  ■=  exp  ^  (£n  n)2  +  0((Jta  n)2)  . 

A  key  step  in  the  proof  of  this  theorem  hinges  on  the  solution  to 
the  fascinating  "almost  linear"  recurrence 

u„  ■  Vi +  “Til  •  <5) 

first  considered  by  Knuth  [9],  This  also  establishes  a  connection 
between  the  theorem  and  ineq.  (4):  ur  is  also  the  number  of  partitions 
of  2n  of  the  form  Ea^  2*,  «  0,  1.  Knuth  [9]  bounds  the  partition 

function 

1  2 

P(m)  -  •— -  exp  (ir  ■=■  m)  . 

4/"3m 
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There  are  two  possibilities  for  improving  the  bounds  in  w(r^,  T,  P”) . 

The  first  possibility  is  to  introduce  circuits  to  the  target  graph  of  the 
previous  theorem,  but  this  does  not  appear  to  give  an  asymptotically 
better  bound  than  (4) .  The  second  possibility  is  to  prove  that  balanced 
trees  and  unbalanced  trees  are  <  -  equivalent.  This  seems  unlikely  since 

combining  such  a  result  with  the  proof  method  of  the  previous  theorem 
gives  a  polynomial  sized  universal  tree.  However,  in  trying  to  Improve 
the  bounds  on  wCr”,  T,  it  may  be  desirable  to  ignore  irregular  trees, 
letting  only  very  balanced  or  very  unbalanced  trees  reside  in  the  same 
universal  data  structure. 

In  any  case,  it  seems  unlikely  that  polynomial  structures  are  possible. 
We  are,  however,  far  from  proving  this;  indeed,  the  best  known  lower  bound 
is  the  following 
Theorem.  For  all  n  >  N 

w(r!j,  T,  r")  >  c(T)  n  log  n  , 
where  c(T)  >  0  is  a  constant  for  fixed  T  1. 

..  + 

Certain  other  subcases  are  also  of  interest.  Erdos,  Chung,  and  Graham  , 
consider  w(S,l)  and  obtain 

4  2 

w(S,l)  <  jjn  . 

The  following  theorem  is  an  improvement,  but  is  si  rely  not  the  best 
possible  bound. 

Theorem 

w(S,l)  <  |  n2 


t  Private  Communication. 
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A  non-trivial  lower  bound  would  clearly  be  desirable.  Another  class  of 
interest  are  graphs  of  high  genus.  We  conjecture  that  for  graphs  of 
fixed  genus  y»  it  is  possible  to  do  better  than  the  naive  (”)  bound 
obtained  by  embedding  in  the  complete  graph. 

Our  next  series  of  results  show  impressive  improvements  by  passing 
to  average  dilations.  We  now  get  optimal  constructions,  even  in  a  variety 
of  limited  settings. 

We  have,  for  instance,  the 
Theorem.  For  a  >  0, 

a(r2‘  a*  S)  =  0(nlog(2+a))  . 

Since  there  is  a  linear  lower  bound  on  a( •,*,*),  this  construction  is 
optimal.  By  a  slight  modification  of  the  construction,  this  gives 
a(r“.  A,  S)  *  0(n) ,  for  all  A  ^  1,  but  this  result  may  be  superceeded  by 
the  following 

Theorem.  For  each  A  >  1,  there  is  a  binary  tree  H,  such  that 

g<7h 

for  all  G  e  r”,  and 

I E(G) |  -  0(n)  ; 

or,  in  other  words 

a(I^,  A,  r")  «  0(n)  . 


tt  A  graph  is  of  genus  y  if  it  can  be  embedded  in  a  sphere  with  Y 
handles  [10]. 
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These  results  are  related  to  the  ability  to  "cut"  graphs  In 
advantageous  ways.  Tor  example,  a  generalization  of  the  planar  separator 
theorem  [11]  to  graphs  of  high  genus,  obtained  by  Lipton  and  Tarjan,  gives 
us  the  following 

Theorem.  Let  l"  be  the  class  of  graphs  G  with  genus  y  and  jV(G)|  =  n. 

Then,  for  all  n  >  N, 

a(L",  A,  r“)  <  c(A)*n  , 
where  c(A)  does  not  depend  on  n. 

EXTENDED  MODEL 

In  comparing  classes  of  data  structures  (see,  e.g.,  [6,7],  the  measures 
of  "efficiency"  have  implicitly  assumed  that  only  sequential  accessing  is 
Important.  Thus,  when  in  [6],  we  bound  the  efficiency,  T,  of  an  embedding 
of  n  x  n  array  into  binary  trees  by 

T  j>  c  log  n  , 

the  function  T(n)  captures  the  dilation  factor  in  an  embedding.  We  now 
describe  a  generalization  of  this  concept  which  recovers  a  certain  kind  of 
random  accessing.  Since  the  precise  definitions  are  quite  complex,  we  will 
settle  for  a  less  exact  —  but  more  picturesque  —  rendering.  Let  us  assume 
that  we  have  in  front  of  us  an  illustration  of  a  graph  G,  and  also  a  number 
of  friends  who  agree  to  lend  us  their  forefingers  for  use  in  tracing  the 
paths  of  the  graph.  Our  friends  oblige  us  as  follows:  We  may  start 
traversing  at  any  vertex  already  visited.  The  traversal  rule  is,  then, 
that  we  must  either  traverse  graph  edges  or  "jump"  to  a  vertex  pointed  to 


by  a  friend.  The  time  required  Co  traverse  a  sequence  of  vertices  is  then 
simply  the  number  of  applications  of  traversal  rules.  Notice  that  the 
result  of  a  traversal  is  not  necessarily  a  path  of  G.  The  connection 
between  fingers  and  random  accessing  is  that  traversals  requiring  k-fingers 
also  require  k-" addresses"  for  the  vertices  pointed  to. 

We  then  say  that  G  <_  ^  ^  G*  if  there  is  a  one-one  4>:V(G)  V(G*) ,  so 

that  for  every  x,y  £  V(G)  with  dG(x,y)  ■  m,  there  is  a  k-finger  traversal 
from  $(x)  *  x*  to  $(y)  =  y*  with  time  at  most  A,  and  A  <  Td„.(x*,  y*) . 

We  have  the  following 

Theorem.  If  G^  is  the  n  *  n  array  [7],  H  is  a  binary  tree 
and 


G  <  ,  .  H  , 

n  —  k,l(n) 

then 


k  +  T(n)  >_  c  log  n  , 
where  c  is  a  constant  independent  of  n. 


OTHER  TYPES  OF  AVERAGE  EMBEDDING 

The  relation  <_  a^g  may  be  thought  of  as  averaging  -  with  relative 

frequencies  uniformly  distributed  to  the  edges  E(G)  -  over  the  edges  of  G. 

We  now  make  a  more  global  definition  which  may  be  used  to  recover  our 

intuitions  about  path  lengths  in  binary  trees  [7].  We  will  essentially 

average  our  shortest  paths: 
paths 


G  < 

A 


G*  if  there  is  an  embedding  4>:V(G)  V(G*)  such  that 


We  then  have  the  following 


Theorem.  For  each  n  >  0.  let  A  be  the  least  real  number  such  that 

—  n 

G  <  Path8  H  , 
n  A 

for  a  binary  tree  H.  Then 

11m  A  *  0 
n-w  n 

Thus,  we  see  that  if  the  average  embedding  is  required  to  work  well  on 

all  shortest  paths,  then  the  embedding  cost  goes  to  zero.  In  a  sense, 

then  <  aVg  "charges"  more  heavily  than  £  Paths  £or  any  bottlenecks. 

A  A 


REFERENCES 

[1]  P.  Erdos  and  J.  Spencer.  Probabilistic  Methods  in  Combinatorics. 
Academic  Press,  19 74. 

[2]  F.  R.  K.  Chung  and  R.  L.  Graham.  On  Graphs  Which  Contain  Small 
Trees.  To  appear  in  JCT. 

[3]  R.  R.  K.  Chung,  R.  L.  Graham,  and  N.  Pippenger.  On  Graphs  Which 
Contain  All  Small  Trees ,  II.  To  appear  in  JCT. 

[4]  M.  Minsky.  Semantic  Information  Processing.  MIT  Press,  1969. 

15]  L.  Uhr,  Pattern  Recognition  Learning  and  Thought.  Prentice-Hall, 
1973. 

[6]  R.  J.  Lip ton,  S.  C.  Eisenstat,  and  R.  A.  DeMillo.  Space  and  Time 
Hierarchies  for  Classes  of  Control  and  Data  Structures.  I ACM  23(4): 
720-732,  October  1976. 

[7]  R.  A.  DeMillo,  S.  C.  Eisenstat,  and  R.  J.  Lipton.  Preserving 
Average  Proximity  In  Arrays.  CACM ,  Vol.  21(3) : 228-231,  March  1978. 

[8]  A.  V.  Aho,  J.  E.  Hopcroft,  and  J.  D.  Ullman.  The  Design  and 
Analysis  of  Computer  Algorithms.  Addison-Wesloy,  1976. 

[9]  D.  E.  Knuth.  An  Almost  Linear  Recurrence.  The  Fibonacci  Quarterly, 
Vol.  4(1) : 117-128,  February  1966. 

(10]  F.  Harary.  Graph  Theory.  Addison-Wesley,  1972. 

[11]  R.  Lipton  and  R.  Tarjan.  A  Planar  Separator  Theorem.  Stanford 
Research  Report  CS-77-627,  October  1977. 


61 


■v  l 


SIAM  J.  APfL  MATH 
V«iM.No-2.A*rill9?9 


O  IfTf  Sooty  fof  InduMrial  and  Applied  Mathc  outset 
00)6-1399/79/3602-0001  SOI. 00/0 


A  SEPARATOR  THEOREM  FOR  PLANAR  GRAPHS* 

RICHARD  J.  LIPTONt  and  ROBERT  ENDRE  TARJANt 


Alien  ail  Let  C  be  any  n  -vertex  planar  graph .  We  prove  that  the  vertices  of  G  can  be  partitioned  into 
three  sets  A.  B,  C  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  contains  more 
than  2n/3  vertices,  and  C  contains  no  more  than  I'Jl'fn  vertices.  We  exhibit  an  algorithm  which  finds  such 
a  partition  A,  B.  C  in  0(n)  time. 


1.  Introduction.  A  useful  method  for  solving  many  kinds  of  combinatorial  pro¬ 
blems  is  “divide-and-conquer”  [1].  In  this  method  the  problem  of  interest  is  divided 
into  two  or  more  smaller  problems.  The  subproblems  are  solved  by  applying  the 
method  recursively,  and  the  subproblem  solutions  are  combined  to  give  the  solution  to 
the  original  problem.  Three  things  are  necessary  for  the  success  and  efficiency  of 
divide-and-conquer:  (i)  the  subproblems  must  be  of  the  same  type  as  the  original  and 
independent  of  each  other  (in  a  suitable  sense);  (ii)  the  cost  of  solving  the  original 
problem  given  the  solutions  to  the  subproblems  must  be  small;  and  (iii)  the  sub¬ 
problems  must  be  significantly  smaller  than  the  original.  One  way  to  guarantee  that 
the  subprobiems  are  small  is  to  make  them  all  roughly  the  same  size  [1]. 

We  wish  to  study  general  conditions  under  which  the  divide-and-conquer 
approach  is  useful.  Consider  problems  which  are  defined  on  graphs.  Let  5  be  a  class  of 
graphs1  closed  under  the  subgraph  relation  (i.e.,  if  Gi  e  S  and  Gi  is  a  subgraph  of  Gu 
then  Gj  e  S).  An  f(n)- separator  theorem  for  5  is  a  theorem  of  the  following  form: 

There  exist  constants  a  <  1,  0  >0  such  that  if  G  is  any  n -vertex  graph  in  S,  the 
vertices  of  G  can  be  partitioned  into  three  sets  A,  B,  C  such  that  no  edge  joins  a 
vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  contains  more  than  an  vertices,  and  C 
contains  no  more  than  /3 /(«)  vertices. 

If  such  a  theorem  holds  for  the  class  of  graphs  S,  and  if  the  appropriate  vertex 
partitions  A,  B,  C  can  be  found  fast,  then  a  number  of  problems  defined  on  graphs  in 
S  can  be  solved  efficiently  using  divide-and-conquer.  For  a  given  graph  G  in  S,  the 
sets  A  and  B  define  the  subproblems.  The  cost  of  combining  the  subproblem  solutions 
is  a  function  of  the  size  of  C  (and  thus  of  f(n)). 

Previously  known  separator  theorems  include  the  following: 

(A)  Any  n -vertex  binary  tree  can  be  separated  into  two  subtrees,  each  with  no 
more  than  2n/3  vertices,  by  removing  a  single  edge.  For  an  application  of  this 
theorem,  see  (13). 

(B)  Any  n -vertex  tree  can  be  divided  into  two  parts,  each  with  no  more  than 
2n/3  vertices,  by  removing  a  single  vertex. 

(C)  A  grid  graph  is  any  subgraph  of  the  infinite  two-dimensional  square  grid 
illustrated  in  Fig.  1.  A  Vn-separator  theorem  holds  for  the  class  of  grid  graphs.  For  an 
application,  see  [5]. 
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Fig.  1.  Infinite  two-dimensional  square  grid. 


(D)  A  one-tape  Turing  machine  graph  [16]  is  a  graph  representing  the  compu¬ 
tation  of  a  one-tape  Turing  machine.  A  Vn-separator  theorem  holds  for  such  graphs. 
For  an  application,  sec  [15], 

One  might  conjecture  that  the  class  of  all  suitably  sparse  graphs  has  an  f(n)- 
separator  theorem  for  some  f(n)  =  o(n).  However,  the  following  result  of  Erdos 
Graham  and  Szemeredi  [4]  shows  that  this  is  not  the  case. 

Theorem  C.  For  every  e  >0  there  is  a  positive  constant  c  -  c(e)  such  that  almost 
all 2  graphs  G  with  n  =  (2  +  e)k  vertices  and  ck  edges  have  the  property  that  after  the 
omission  of  any  k  vertices,  a  connected  component  of  at  least  k  vertices  remains. 

Although  sparsity  by  itself  is  not  enough  to  give  a  useful  separator  theorem, 
planarity  is.  In  §  2  of  this  paper  we  prove  that  a  Vn-separator  theorem  holds  for  all 
planar  graphs.  In  S  3  we  provide  a  linear-time  algorithm  for  finding  a  vertex  partition 
satisfying  the  theorem.  This  algorithm  and  the  divide-and-conquer  approach  combine 
to  give  efficient  algorithms  for  a  wide  range  of  problems  on  planar  graphs.  Section  4 
mentions  some  of  these  applications,  which  we  shall  discuss  more  fully  in  a  subsequent 
paper. 

2.  Separator  theorems.  To  prove  our  results  we  need  to  use  three  facts  about 
planarity. 

Theorem  1  (Jordan  curve  theorem  (6)).  Let  C  be  any  closed  curve  in  the  plane. 
Removal  of  C divides  the  plane  into  exactly  two  connected  regions,  the  "inside"  and  the 
“ outside ”  of  C. 

Theorem  2  [7].  Any  n- vertex  planar  graph  with  n§3  contains  no  more  than  3  n  -  6 
edges. 


J  By  '‘almost  all'  we  mean  that  the  fraction  of  graphs  possessing  the  property  tends  with  increasing  n  to 
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(•)  (b) 

FlO.  2.  Kuratowski  subgraphs  (*)  Ks;  (b)  Kjj. 


Theorem  3  (Kuratowski's  theorem  [12]).  A  graph  is  planar  if  and  only  if  it 
contains  neither  a  complete  graph  on  five  vertices  (Fig.  2(a))  nor  a  complete  bipartite 
graph  on  two  sets  of  three  vertices  (Fig.  2(b))  as  a  generalized  subgraph. 

From  Kuratowski’s  theorem  we  can  easily  obtain  the  following  lemma  and  its 
corollary. 

Lemma  1.  Let  G  be  any  planar  graph.  Shrinking  any  edge  of  G  to  a  single  vertex 
preserves  planarity. 

Proof.  Let  G*  be  the  shrunken  graph,  let  (xt,  x2)  be  the  edge  shrunk,  and  let  x  be 
the  vertex  corresponding  to  x,  and  x2  in  G*.  If  G*  is  not  planar  then  G*  contains  a 
Kuratowski  graph  as  a  generalized  subgraph.  But  this  subgraph  corresponds  to  a 
Kuratowski  graph  which  is  a  generalized  subgraph  of  G.  Figure  3  illustrates  the 
possibilities.  □ 


or 


=b 


FlO.  3.  Shrinking  an  tdgt  to  form  a  Kuratowski  graph.  Original  graph  must  contain  a  Kuratowski  graph 
as  a  gtntralited  subgraph. 


Corollary  1.  Let  G  be  any  planar  graph.  Shrinking  any  connected  subgraph  of 
G  to  a  single  vertex  preserves  planarity. 

Proof.  The  proof  is  immediate  from  Lemma  1  by  induction  on  the  number  of 
vertices  in  the  subgraph  to  be  shrunk.  □ 

In  some  applications  it  is  useful  to  have  a  result  more  general  than  the  kind  of 
separator  theorem  described  in  the  Introduction.  We  shall  therefore  consider  planar 
graphs  which  have  nonnegative  costs  on  the  vertices.  We  shall  prove  that  any  such 
graph  can  be  separated  into  two  parts,  each  with  cost  no  more  than  two-thirds  of  the 
total  cost,  by  removing  0('/n)  vertices.  The  desired  separator  theorem  is  the  special 
case  of  equal-cost  vertices. 

Lemma  2.  Let  G  be  any  planar  graph  with  nonnegative  vertex  costs  summing  to  no 
more  than  one.  Suppose  G  has  a  spanning  tree  of  radius  r.  Then  the  vertices  of  G  can  be 
partitioned  into  three  sets  A,  B.  C,  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in 
B,  neither  A  nor  B  has  total  cost  exceeding  2/3,  and  C  contains  no  more  than  2r  +  \ 
vertices,  one  the  root  of  the  tree. 
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Proof.  Assume  no  vertex  has  cost  exceeding  1/3;  otherwise  the  lemma  is  true 
Embed  G  in  the  plane.  Make  each  face  a  triangle  by  adding  a  suitable  number  of 
additional  edges.  Any  nontrec  edge  (including  each  of  the  added  edges)  forms  a 
simple  cycle  with  some  of  the  tree  edges.  This  cycle  is  of  length  at  most  2r+  1  if  u 
contains  the  root  of  the  tree,  at  most  2r  -  1  otherwise.  The  cycle  divides  the  plane  (and 
the  graph)  into  two  parts,  the  inside  and  the  outside  of  the  cycle.  We  claim  that  at  least 
one  such  cycle  separates  the  graph  so  that  neither  the  inside  nor  the  outside  contains 
vertices  whose  total  cost  exceeds  2/3.  This  proves  the  lemma. 

Proof  of  claim.  Let  (x,  z)  be  the  nontree  edge  whose  cycle  minimizes  the  maxi¬ 
mum  cost  either  inside  or  outside  the  cycle.  Break  ties  by  choosing  the  nontree  edge 
whose  cycle  has  the  smallest  number  of  faces  on  the  same  side  as  the  maximum  cost.  If 
ties  remain,  choose  arbitrarily. 

Suppose  without  loss  of  generality  that  the  graph  is  embedded  so  that  the  com 
inside  the  (x,  z)  cycle  is  at  least  as  great  as  the  cost  outside  the  cycle.  If  the  vertices 
inside  the  cycle  have  total  cost  not  exceeding  2/3,  the  claim  is  true.  Suppose  the 
vertices  inside  the  cycle  have  total  cost  exceeding  2/3.  We  show  by  case  analysis  that 
this  contradicts  the  choice  of  ( x ,  z).  Consider  the  face  which  has  (x,  z)  as  a  boundary 
edge  and  lies  inside  the  cycle.  This  face  is  a  triangle;  let  y  be  its  third  vertex.  The 
properties  of  (x,  y )  and  (y,  z )  determine  which  of  the  following  cases  applies.  Figure  4 
illustrates  the  cases. 
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1 )  Both  (x,  y )  and  (y.  z )  lie  on  the  cycle.  Then  the  face  (x,  y,  x  )  is  the  cycle,  which 
is  impossible  since  vertices  lie  inside  the  cycle. 

2)  One  of  (x,  y)  and  (y,  z)  (say  (x,  y))  lies  on  the  cycle.  Then  (y,  z)  is  a  nontree 
edge  defining  a  cycle  which  contains  within  it  the  same  vertices  as  the  original  cycle 
hut  one  less  face.  This  contradicts  the  choice  of  (x,  z ). 

3)  Neither  (x,  y )  nor  (y,  z )  lies  on  the  cycle. 

a)  Both  (x,  y)  and  (y,  z)  are  tree  edges.  This  is  impossible  since  the  tree  itself 
contains  no  cycles. 

b)  One  of  (x,  y)  and  (y,  z)  (say  (x,  y))  is  a  tree  edge.  Then  (y,  z)  is  a  nontree  edge 
defining  a  cycle  which  contains  one  less  vertex  (namely  y)  within  it  than  the  original 
cycle.  The  inside  of  the  (y,  z)  cycle  contains  no  more  cost  and  one  less  face  than  the 
inside  of  the  (x,  x)  cycle.  Thus  if  the  cost  inside  the  (y,  z )  cycle  is  greater  than  the  cost 
outside  the  cycle,  (y,  z)  would  have  been  chosen  in  place  of  (x,  z). 

On  the  other  hand,  suppose  the  cost  inside  the  (y,  z)  cycle  is  no  greater  than  the 
cost  outside.  The  cost  outside  the  (y,  z)  cycle  is  equal  to  the  cost  outside  the  (x,  z) 
cycle  plus  the  cost  of  y.  Since  both  the  cost  outside  the  (x,  z )  cycle  and  the  cost  of  y  are 
less  than  1/3,  the  cost  outside  the  (y,  x)  cycle  is  less  than  2/3,  and  (y,  x)  would  have 
been  chosen  in  place  of  (x,  x ). 

c)  Neither  (x,  y )  nor  (y,  x)  is  a  tree  edge.  Then  each  of  (x,  y)  and  (y,  x)  defines  a 
cycle,  and  every  vertex  inside  the  (x,  x )  cycle  is  either  inside  the  (x,  y )  cycle,  inside  the 
(y,  x)  cycle,  or  on  the  boundary  of  both.  Of  the  (x,  y)  and  (y,  x)  cycles,  choose  the  one 
(say  (x,  y ))  which  has  inside  it  more  total  cost.  The  (x.  y)  cycle  has  no  more  cost  and 
strictly  fewer  faces  inside  it  than  the  (x,  x)  cycle.  Thus  if  the  cost  inside  the  (x,  y)  cycle 
is  greater  than  the  cost  outside,  (x,  y)  would  have  been  chosen  in  place  of  (x,  x). 

On  the  other  hand,  suppose  the  cost  inside  the  (x,  y)  cycle  is  no  greater  than  the 
cost  outside.  Since  the  inside  of  the  (x,  x )  cycle  has  cost  exceeding  2/3,  the  (x,  y )  cycle 
and  its  inside  together  have  cost  exceeding  1/3,  and  the  outside  of  the  (x,  y)  cycle  has 
cost  less  than  2/3.  Thus  (x,  y)  would  have  been  chosen  in  place  of  (x,  x). 

Thus  all  cases  are  impossible,  and  the  (x,  x )  cycle  satisfies  the  claim.  □ 

Lemma  3.  Lei  G  be  any  n -vertex  connected  planar  graph  having  nonnegative 
vertex  costs  summing  to  no  more  than  one.  Suppose  that  the  vertices  of  G  are  partitioned 
into  levels  according  to  their. distance  from  some  vertex  v,  and  that  L(l)  denotes  the 
number  of  vertices  on  level  l.  If  r  is  the  maximum  distance  of  any  vertex  from  v,  let  r  + 1 
be  an  additional  level  containing  no  vertices.  Given  any  two  levels  l\  and  h  such  that 
levels  0  through  l\  —  l  have  total  cost  not  exceeding  2/3  and  levels  /2  +  1  through  r+  1 
have  total  cost  not  exceeding  2/3,  it  is  possible  to  find  a  partition  A ,  B,  C  of  the  vertices 
of  G  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  has  total 
cost  exceeding  2/3,  and  C  contains  no  more  than  Z.(/i)  +  JL(/2)+max  (0,  2(l2~li  -  1)} 
vertices. 

Proof.  If  /|  S/j,  let  A  be  all  vertices  on  levels  0  through  L~l,  B  all  vertices  on 
levels  fi  +  1  through  r,  and  C  all  vertices  on  level  l\.  Then  the  lemma  is  true.  Thus 
suppose  /i</2.  Delete  the  vertices  in  levels  l\  and  /2  from  G.  This  separates  the 
remaining  vertices  of  G  into  three  parts  (all  of  which  may  be  empty):  vertices  on  levels 
0  through  /» —  1.  vertices  on  levels  l\  + 1  through  f2  —  1,  and  vertices  on  levels  /2  + 1  and 
above.  The  only  part  which  can  have  cost  exceeding  2/3  is  the  middle  part. 

If  the  middle  part  does  not  have  cost  exceeding  2/3,  lef  A  be  the  most  costly  part 
of  the  three,  let  B  be  the  remaining  two  parts,  and  let  C  be  the  set  of  vertices  on  levels 
/i  and  /j.  Then  the  lemma  is  true. 

Suppose  the  middle  part  has  cost  exceeding  2/3.  Delete  all  vertices  on  levels  /2 
and  above  and  shrink  all  vertices  on  levels  l\  and  below  to  a  single  vertex  of  cost  zero. 
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These  operations  preserve  planarity  by  Corollary  1.  The  new  graph  has  a  spanning 
tree  of  radius  /2-/i  - 1  whose  root  corresponds  to  vertices  on  levels  h  and  below  in 
the  original  graph. 

Apply  Lemma  2  to  the  new  graph.  Let  A*,  B*,  C *  be  the  resulting  vertex 
partition.  Let  A  be  the  set  among  A*  and  B*  having  greater  cost,  let  C  consist  of  the 
vertices  on  levels  t\  and  /2  in  the  original  graph  plus  the  vertices  in  C*  minus  the  root 
of  the  tree,  and  let  B  contain  the  remaining  vertices  in  G.  By  Lemma  2,  A  has  total 
cost  not  exceeding  2/3.  But  A  U  C*  has  total  cost  at  least  1/3,  so  B  also  has  total  cost 
not  exceeding  2/3.  Furthermore  C  contains  no  more  than  L(h)+ L(l2)+2(l2- lx~  \) 
vertices.  Thus  the  lemma  is  true.  □ 

Theorem  4.  Let  G  be  any  n -vertex  planar  graph  having  nonnegative  vertex  costs 
summing  to  no  more  than  one.  Then  the  vertices  of  G  can  be  partitioned  into  three  sets  A. 
B,  C  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  has  total 
cost  exceeding  2/3,  and  C  contains  no  more  than  l-Jl'fn  vertices. 

Proof.  Assume  G  is  connected.  Partition  the  vertices  into  levels  according  to  their 
distance  from  some  vertex  t>.  Let  L(l)  be  the  number  of  vertices  on  level  l.  If  r  is  the 
maximum  distance  of  any  vertex  from  v,  define  additional  levels  -1  and  r+ 1  contain¬ 
ing  no  vertices. 

Let  / 1  be  the  level  such  that  the  sum  of  costs  in  levels  0  through  l\-l  is  less 
than  1/2,  but  the  sum  of  costs  in  levels  0  through  /i  is  at  least  1/2.  (If  no  such  l\  exists, 
the  total  cost  of  all  vertices  is  less  than  1/2,  and  B  =  C  =  0  satisfies  the  theorem.)  Let 
ft  be  the  number  of  vertices  on  levels  0  through  l\.  Find  a  level  l0  such  that  l0£lt  and 
|L(/0)|  +  2(/1-io)g2v^.  Find  a  level  l2  such  that  /i  +  lg/2  and  |L(/2)|  +  2(/2-/i - 1)§ 
2  yfit-k.  If  two  such  levels  exist,  then  by  Lemma  3  the  vertices  of  G  can  be  partitioned 
into  three  sets  A,  B,  C  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither 
A  nor  C  has  cost  exceeding  2/3,  and  C  contains  no  more  than  2(V&  +  >/ n  -  k)  vertices. 
But  ll'fk Wn  -fc)§  2(V n/2  +  Vn/2)  =  l-JlJn.  Thus  the  theorem  holds  if  suitable 
levels  l0  and  l2  exist. 

Suppose  a  suitable  level  l0  does  not  exist.  Then;  for  i  S  L(i)£2->/k-2(l,-i). 
Since  L(0)=1,  this  means  lg2'/*-2f|,  and  li+l/2&\/k.  Thus  lI=L/l  +  l/2j£ 
iVi] ,  and 

t(/)5  i  2Vh-2(/l~i)S(4y/k-2ly/kJ)(lVkJ+l)/2sVk(lVkj+l)>k. 

i-o  j— rt— ivTj 

This  is  a  contradiction.  A  similar  contradiction  arises  if  a  suitable  level  l2  does  not 
exist.  This  completes  the  proof  for  connected  graphs. 

Now  suppose  G  is  not  connected.  Let  Gu  G2,  •  •  ■ ,  Gk  be  the  connected 
components  of  G,  with  vertex  sets  Vx,  V2,  •  -  • ,  14,  respectively.  If  no  connected 
component  has  total  vertex  cost  exceeding  1/3,  let  i  be  the  minimum  index  such  that 
the  total  cost  of  V,  U  V2U  •  •  •  U  V,  exceeds  1/3.  Let  A  =  V,U  V2U  -  •  •  U  V*.  let 
B  *  V,+|U  V,«.2U  •  •  •  U  V4,  and  let  C  =  0.  Since  i  is  minimum  and  the  cost  of  V’, 
does  not  exceed  1/3,  the  cost  of  A  does  not  exceed  2/3.  Thus  the  theorem  is 
true. 

If  some  connected  component  (say  Gi)  has  total  vertex  cost  between  1/3  and  2/3. 
let  A  =  V(,  B  =  V,U  •  •  ■  U  V,_,U  V(*,U  •  •  •  U  V*.  and  C=  0.  Then  the  theorem  is 
true. 

Finally,  if  some  connected  component  (say  G.)  has  total  vertex  cost  exceeding 
2/3,  apply  the  above  argument  to  G,.  Let  A*,  B*.  C*  be  the  resulting  partition.  Let  A 
9  be  the  set  among  A*  and  B*  with  greater  cost,  let  C  =  C*,  and  let  B  be  the  remaining 
vertices  of  G.  Then  A  and  B  have  cost  not  exceeding  2/3  and  the  theorem  is  true. 
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This  proves  the  theorem  for  all  planar  graphs.  In  all  cases  the  separator  C  is 
either  empty  or  contained  in  only  one  connected  component  of  G.  □ 

Corollary  2  (vn-separator  theorem).  Let  G  be  any  n-vertex  planar  graph.  The 
vertices  of  G  can  be  partitioned  into  three  sets  A ,  B,  C  such  that  no  edge  joins  a  vertex  in 
A  with  a  vertex  in  B,  neither  A  nor  B  contains  more  than  2n/3  vertices,  and  C  contains 
no  more  than  l-Jlyfn  vertices. 

Proof.  Assign  to  each  vertex  of  G  a  cost  of  1/n.  The  corollary  follows  from 
Theorem  4.  □ 

It  is  natural  to  ask  whether  the  constant  factor  of  2/3  in  Theorem  1  can  be 
reduced  to  1/2  if  the  constant  factor  of  2^2  is  allowed  to  increase.  The  answer  is  yes. 

Corollary  3.  Let  G  be  any  n-vertex  planar  graph  having  nonnegative  vertex 
costs  summing  to  no  more  than  one.  Then  the  vertices  of  G  can  be  partitioned  into  three 
sets  A,  B,  C  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  has 
total  cost  exceeding  1/2,  and  C  contains  no  more  than  2>/2vn/(l  ->/2/3)  vertices. 

Proof.  Let  G  =  (V,E)  be  an  n-vertex  planar  graph.  We  shall  define  sequences  of 
sets  (A,),  (Bj),  (Ci),  (Di)  such  that: 

(i)  Ah  Blt  Ci,  Di  partition  V, 

(ii)  no  edge  joins  Ai  with  B„  A,  with  D„  or  B,  with  D„ 

(iii)  the  cost  of  A,  is  no  greater  than  the  cost  of  B,  and  the  cost  of  B,  is  no  greater 
than  the  cost  of  A,  U  C,  U  D„ 

(iv)  |Z>,|32|A-,!/3. 

Let  A0  —  B0  —  C0—  0,  Do  —  V.  Then  (i)-(iv)  hold.  If  A,_ ,,  C,_i,  Di- i  have 

been  defined  and  £>j_i  *  0,  let  G*  be  the  subgraph  of  G  induced  by  the  vertex  set 
Di-\.  Let  A*.  B*,  C*  be  a  vertex  partition  satisfying  Corollary  2  on  G*.  Without 
loss  of  generality,  suppose  A*  has  no  more  cost  than  B*.  Let  A,  be  the  set  among 
Aj-iUA*,  B,- 1  with  less  cost,  let  B,  be  the  set  among  A,_,  U A*,  fl,_,  with 
greater  cost,  let  C,  =  C,-i  U  C*,  and  let  D,  =  B*.  Then  (i),  (ii),  (iii),  and  (iv)  hold  for 
Ai,  Bit  Ch  Di. 

Let  k  be  the  largest  index  for  which  Ak,  Bk.  Ck,  Dk  are  defined.  Then  Dk  =  0 . 
Let  A  =  A*.  B  *  Bk,  C  =  Ck.  By  (i).  A,  B,  C  partition  V.  By  (ii),  no  edge  joins  a  vertex 
in  A  with  a  vertex  in  B.  By  (iii),  neither  A  nor  B  has  cost  exceeding  1/2.  By  (iv),  the 
total  number  of  vertices  in  C  is  bounded  by 

I  2v'2V^(2/3f2=  =  2V^L.  □ 

iZ0  1-V2/3 

Another  natural  question  is  whether  graphs  which  are  “almost"  planar  have  a 
Vn-separator  theorem.  The  finite  element  method  of  numerical  analysis  gives  rise  to 
one  interesting  class  of  almost-planar  graphs.  We  shall  extend  Theorem  4  to  apply  to 
such  graphs. 

A  finite  element  graph  is  any  graph  formed  from  a  planar  embedding  of  a  planar 
graph  by  adding  all  possible  diagonals  to  each  face.  (The  finite  element  graph  has  a 
clique  corresponding  to  each  face  of  the  embedded  planar  graph.)  The  embedded 
planar  graph  is  called  the  skeleton  of  the  finite  element  graph  and  each  of  its  faces  i.»  an 
element  of  the  finite  element  graph. 

Theorem  5.  Let  G  be  an  n-vertex  finite  element  graph  with  nonnegative  vertex 
costs  summing  to  no  more  than  one.  Suppose  no  element  of  G  has  more  than  k  boundary 
vertices.  Then  the  vertices  of  G  can  be  partitioned  into  three  sets  A,  B,  C  such  that  no 
edge  joins  a  vertex  in  A  with  a  vertex  inB,  neither  A  nor  B  has  total  cost  exceeding  2/3, 
and  C contains  no  more  than  4  [fc/2j  V n  vertices. 


I 


i 
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Proof.  Let  G*  be  the  skeleton  of  G.  Form  G**  from  G*  by  inserting  one  new 
vertex  into  each  face  of  G*  containing  four  or  more  vertices  and  connecting  the  new 
vertex  to  each  vertex  on  the  boundary  of  the  face.  Then  G**  is  planar.  Apply 
Theorem  4  to  G**.  Let  A **,  B**,  C**  be  the  resulting  vertex  partition.  This  partition 
satisfies  the  theorem  except  that  certain  edges  in  G  but  not  in  G**  may  join  A**  and 
B**.  These  edges  are  diagonals  of  certain  faces  of  G*;  call  these  bad  faces.  Each  bad 
face  must  contain  one  of  the  new  vertices  added  to  G*  to  form  G**,  and  this  vertex 
must  be  in  C**. 

Form  G  from  C**  by  deleting  alt  new  vertices  and  adding  to  G**.  for  each  bad 
face,  either  the  set  of  vertices  in  A **  on  the  boundary  of  the  bad  face,  or  the  set  of 
vertices  in  B**  on  the  boundary  of  the  bad  face,  whichever  is  smaller.  Let  A  be  the 
remaining  old  vertices  in  A**  and  let  B **  be  the  remaining  old  vertices  in  B**.  Then 
no  edge  in  G  joins  A  and  B ,  neither  A  nor  B  contains  more  than  2n/3  vertices,  and  C 
contains  no  more  than  2 %/2  [k/2  J  v'n  +  a  vertices,  where  a  is  the  number  of  faces  of  G* 
containing  four  or  more  vertices.  By  use  of  Euler's  theorem,  it  is  not  hard  to  show  that 
the  number  of  faces  of  G*  containing  four  or  more  vertices  is  at  most  n-2.  Thus 
|C|  £  lAr/2J  >fn,  and  the  theorem  is  true.  □ 

Corollary  4.  Let  G  be  any  n-vertex  finite  element  graph.  Suppose  no  element  of 
G  has  more  than  k  boundary  vertices.  The  vertices  of  G  can  be  partitioned  into  three  sets 
A,  B,  C  such  that  no  edge  joins  a  vertex  in  A  with  a  vertex  in  B,  neither  A  nor  B  contains 
more  than  2n/3  vertices  and  C  contains  no  more  than  4 1 Jfc/2  J  \fn  vertices. 

The  last  result  of  this  section  shows  that  Theorem  4  and  its  corollaries  are  tight  to 
within  a  constant  factor;  that  is,  if  /(n)  =  o(vn),  no  /(n)-separator  theorem  holds  for 
planar  graphs. 

Theorem  6.  For  any  kJetG  =  ( V,  E)  beak*  k  square  grid  graph  (a  k  x  fc  square 
section  of  the  infinite  grid  graph  in  Fig.  1).  Let  A  be  any  subset  of  V  such  that 
an  5 1 A |  £ n/2,  where  n  =  k2  and  a  is  a  positive  constant  less  than  1  /2.  The  the  number 
of  vertices  in  V-A  adjacent  to  some  vertex  in  A  is  at  least  k  ■  min  {1/2,  >/a}. 

Proof  Without  loss  of  generality,  suppose  that  the  number  r  of  rows  of  G  which 
contain  vertices  in  A  is  no  less  than  the  number  c  of  columns  of  G  which  contain 
vertices  in  A.  Then  an  £  |A|  £  rc  £  r1  and  r  a  \  ak. 

If  r*  is  the  number  of  rows  of  G  which  contain  only  vertices  in  A,  then  fcr*  £  |A|  £ 
n/2,  and  r*Sk/2.  Let  S  =  {xe  V:  x  is  adjacent  to  a  vertex  of  A}.  If  r*  =  0,  then 
|S|  &  r  2  >/o  k.  If  r*  *  0,  then  r  =  k  and  |S|  S  r  -  r*  =  -  r*  §  <c  |2 .  0 

It  is  an  open  problem  to  determine  the  smallest  constant  factor  which  can  replace 
2>/2  in  Theorem  4. 

3.  An  algorithm  for  finding  a  good  partition.  The  proof  of  Theorem  4  leads  to  an 
algorithm  for  finding  a  vertex  partition  satisfying  the  theorem.  To  make  this  algorithm 
efficient,  we  need  a  good  representation  of  a  planar  embedding  of  a  graph.  For  this 
purpose  we  use  a  list  structure  whose  elements  correspond  to  the  edges  of  the  graph. 
Stored  with  each  edge  are  its  endpoints  and  four  pointers,  designating  the  edges 
immediately  clockwise  and  counter-clockwise  around  each  of  the  endpoints  of  the 
edge.  Stored  with  each  vertex  is  some  incident  edge.  Figure  5  gives  an  example  of  such 
a  data  structure. 

Partitioning  Algorithm. 

Step  1.  Find  a  planar  embedding  of  G  and  construct  a  representation  for  it  of  the 
kind  described  above. 


Time:  0(n ),  using  the  algo  tofflOj. 
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Edges  and  neighbors 

c,  cc,  Cj  cc2 


Fig.  S.  Representation  of  an  embedded  planar  graph,  (c  =  clockwise,  cc  -  counter-clockwise.) 


Step  2.  Find  the  connected  components  of  G  and  determine  the  cost  of  each  one. 
If  none  has  cost  exceeding  2/3,  construct  the  partition  as  described  in  the  proof  of 
Theorem  4.  If  some  component  has  cost  exceeding  2/3,  go  to  Step  3. 

Time:  O(n)  (9], 

Step  3.  Find  a  breadth-first  spanning  tree  of  the  most  costly  component.  Compute 
the  level  of  each  vertex  and  the  number  of  vertices  L(l)  in  each  level  /. 

Time:  O(n). 

Step  4.  Find  the  level  fi.such  that  the  total  cost  of  levels  0  through  /» —  1  does  not 
exceed  1/2,  but  the  total  cost  of  levels  0  through  l,  does  exceed  1/2.  Let  k  be  the 
number  of  vertices  in  levels  0  through  l\. 

Time:  O(n). 

Step  5.  Find  the  highest  level  /0£/i  such  that  L(/0) +2(lt-  i0)  5  2-/k.  Fiwd  the 
lowest  level  /2S/i  +  l  such  that  L(/2)+2(f2-/i-l)s2'/n-fc. 

Time:  0{n). 

Step  6.  Delete  all  vertices  on  level  /2  and  above.  Construct  a  new  vertex  x  to 
represent  all  vertices  on  levels  0  through  h,.  Construct  a  Boolean  table  with  one  entry 
per  vertex.  Initialize  to  true  the  entry  for  each  vertex  on  levels  0  through  /n  and 
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(b)  0.4).  (2.6).  Or?),  (3.5), 


FlG.  6.  Shrinking  a  subtree  of  a  planar  graph. 

(a)  Original  graph.  Subtree  denoted  by  mdmmw  . 

(b)  Edges  scanned  around  subtree.  Those  forming  loops  and  multiple  edges  in  shrunken  graph 
an  crossed  out. 

(c)  Shrunken  graph.  Vertex  0  replaces  subtree. 

initialize  to  false  the  entry  for  each  vertex  on  levels  l0+ 1  through  /2  —  1 .  The  vertices 
on  levels  0  through  l0  correspond  to  a  subtree  of  the  breadth-first  spanning  tree 
generated  in  Step  3.  Scan  the  edges  incident  to  this  tree  clockwise  around  the  tree. 
When  scanning  an  edge  (v,  w)  with  v  in  the  tree,  check  the  table  entry  for  w.  If  it  is 
true,  delete  edge  (c,  w).  If  it  is  false,  change  it  to  true,  construct  an  edge  (x,  w),  and 
delete  edge  ( v ,  w).  The  result  of  this  step  is  a  planar  representation  of  the  shrunken 
graph  to  which  Lemma  2  is  to  be  applied.  See  Fig.  6. 

Time:  0(n). 

Step  7.  Construct  a  breadth-first  spanning  tree  rooted  at  x  in  the  new  graph.  (This 
can  be  done  be  modifying  the  breadth-first  spanning  tree  constructed  in  Step  3.) 
Record,  for  each  vertex  v,  the  parent  of  v  in  the  tree,  and  the  total  cost  of  all 
descendants  of  v  including  v  itself.  Make  all  faces  of  the  new  graph  into  triangles  by 
scanning  the  boundary  of  each  face  and  adding  (nontree)  edges  as  necessary. 


Step  8.  Choose  any  nontree  edge  (i>i,  wi).  Locate  the  corresponding  cycle  by 
following  parent  pointers  from  vx  and  w(.  Compute  the  cost  on  each  side  of  this  cycle 
by  scanning  the  tree  edges  incident  on  either  side  of  the  cycle  and  summing  their 
associated  costs.  If  ( v ,  w)  is  a  tree  edge  with  v  on  the  cycle  and  w  not  on  the  cycle,  the 
cost  associated  with  (v,  w)  is  the  descendant  cost  of  w  if  v  is  the  parent  of  w,  and  the 
cost  of  all  vertices  minus  the  descendant  cost  of  v  if  w  is  the  parent  of  v.  Determine 
which  side  of  the  cycle  has  greater  cost  and  call  it  the  ‘‘inside”.  See  Fig.  7. 
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PlC.  7.  Cycle  constructed  in  Step  8.  Alt  vertices  have  cost  .02.  Numbers  on  vertices  are  descendant  costs. 
The  total  cost  inside  the  cycle  is  .48.  outside  the  cycle  is  .34,  and  on  the  cycle  is  .18. 


Step  9.  Let  (u„  be  the  nontree  edge  whose  cycle  is  the  current  candidate  to 
complete  the  separator.  If  the  cost  inside  the  cycle  exceeds  2/3,  find  a  better  cycle  by 
the  following  method. 

Locate  the  triangle  (t)„  y,  w,)  which  has  (u„  *v,  )  as  a  boundary  edge  and  lies  inside 
the  (tii,  w,)  cycle.  If  either  (v„  y)or  (y,  w.)  is  a  tree  edge,  let  w(-,)  be  the  nontree 
edge  among  ( vit  y)  and  (y,  tv,).  Compute  the  cost  inside  the  (u1+1,  tv1+1)  cycle  from  the 
cost  inside  the  (u„  tv,)  cycle  and  the  cost  of  v„  y.  and  tv,.  See  Fig.  4. 

If  neither  («/„  y)  nor  (y,  tv,)  is  a  tree  edge,  determine  the  tree  path  from  y  to  the 
(«>i,  tv()  cycle  by  following  parent  pointers  from  y.  Let  z  be  the  vertex  on  the  (u„  tv,) 
cycle  reached  during  this  search.  Compute  the  total  cost  of  all  vertices  except  z  on  this 
tree  path.  Scan  the  tree  edges  inside  the  (y,  w>)  cycle,  alternately  scanning  an  edge  in 
one  cycle  and  an  edge  in  the  other  cycle.  Stop  scanning  when  all  edges  inside  one  of 
the  cycles  have  been  scanned.  Compute  the  cost  inside  this  cycle  by  summing  the 
associated  costs  of  all  scanned  edges.  Use  this  cost,  the  cost  inside  the  (v„  *v,)  cycle, 
and  the  cost  on  the  tree  path  from  y  to  z  to  compute  the  cost  inside  the  other  cycle. 
Let  w,»1)  be  the  edge  among  (vi,  y )  and  (y,  w, )  whose  cycle  has  more  cost  inside 
it. 

Repeat  Step  9  until  finding  a  cycle  whose  inside  has  cost  not  exceeding  2/3. 

77 me:  O(n)  (see  proof  below). 

Step  10.  Use  the  cycle  found  in  Step  9  and  the  levels  found  in  Step  4  to  construct  a 
satisfactory  vertex  partition  as  described  in  the  proof  of  Lemma  3.  Extend  this 
partition  from  the  connected  component  chosen  in  Step  2  to  the  entire  graph  as 
described  in  the  proof  of  Theorem  4. 

Time-.  O(n). 

This  completes  our  presentation  of  the  algorithm.  All  steps  except  Step  9 
obviously  run  in  O(n)  time.  We  urge  readers  to  fill  in  the  details  of  this  algorithm;  we 
content  ourselves  here  with  proving  that  Step  9  requires  O(n)  time. 

Proof  of  Step  9  time  bound.  Each  iteration  of  Step  9  deletes  at  least  one  face  from 
the  inside  of  the  current  cycle.  Thus  Step  9  terminates  after  O(n)  iterations.  The  total 
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running  time  of  one  iteration  of  Step  9  is  0(1)  plus  time  proportional  to  the  length  of 
the  tree  path  from  y  to  z  plus  time  proportional  to  the  number  of  edges  scanned  inside 
the  (t»(,  y )  and  (y,  w()  cycles.  Each  vertex  on  the  tree  path  from  y  to  z  (except  z)  is 
inside  the  current  cycle  but  on  the  boundary  or  outside  of  all  subsequent  cycles.  For 
every  two  edges  scanned  during  an  iteration  of  Step  9,  at  least  one  edge  is  inside  the 
current  cycle  but  outside  all  subsequent  cycles.  It  follows  that  the  total  time  spent 
traversing  tree  paths  and  scanning  edges,  during  all  iterations  of  Step  9,  is  O(n).  Thus 
the  total  time  spent  in  Step  9  is  0(n ).  □ 

By  making  minor  modifications  to  this  algorithm,  one  can  construct  an  Ofn )  time 
algorithm  to  find  a  vertex  partition  satisfying  Theorem  5,  and  0(n )  time  algorithms  to 
find  vertex  partitions  satisfying  Corollary  2  and  Corollary  4. 

4.  Applications.  The  separator  theorem  proved  in  §  2  allows  us  to  obtain  many 
new  complexity  results  since  it  opens  the  way  for  efficient  application  of  divide-and- 
conquer  on  planar  graphs.  We  mention  a  few  such  applications  here;  we  shall  present 
the  details  in  a  subsequent  paper. 

Generalized  nested  dissection.  Any  system  of  linear  equations  whose  sparsity 
structure  corresponds  to  a  planar  or  finite  element  graph  can  be  solved  in  0(n3/~)  time 
and  0(n  log  n)  space.  This  result  generalizes  the  nested  dissection  method  of  George 
15]. 

Pebbling.  Any  n -vertex  planar  acyclic  directed  graph  with  maximum  in-degree  k 
can  be  pebbled  using  0(Vn  +  fc  logn)  pebbles.  See  [8],  [16]  for  a  description  of  the 
pebble  game. 

The  post  office  problem.  Knuth’s  “post  office”  problem  [11]  can  be  solved  in 
0(Oog  fi)r)  time  and  Ofn)  space.  See  [3],  [17]  for  previous  results. 

Data  structure  embedding  problems  Any  planar  data  structure  can  be  efficiently 
embedded  into  a  balanced  binary  tree.  See  [2],  [14]  for  a  description  of  the  problem 
and  some  related  results. 

Lower  bounds  on  Boolean  circuits.  Any  planar  circuit  for  computing  Boolean 
convolution  contains  at  least  cn 2  gates  for  some  positive  constant  c. 

Appendix.  Graph-theoretic  definitions.  A  graph  G  =  ( V,  E)  consists  of  a  set  V  of 
vertices  and  a  set  E  of  edges.  Each  edge  is  an  unordered  pair  (t>,  w)  of  distinct  vertices. 
If  (v,  w)  is  an  edge,  v  and  w  are  adjacent  and  (u,  w)  is  incident  to  both  v  and  w.  A  path 
of  length  k  with  endpoints  v,  w  is  a  sequence  of  vertices  v  -  Vo,  t>i,  t>2,  •  •  • ,  t>i  *  w  such 
that  (v,-i,  is  an  edge  for  1  S  i  S  k.  If  all  the  vertices  v0,  V\,  •  •  • ,  vk-i  are  distinct,  the 
path  is  simple.  If  v  -  w,  the  path  is  a  cycle.  The  distance  from  v  to  w  is  the  length  of  the 
shortest  path  from  v  to  w.  (The  distance  is  infinite  if  v  and  w  are  not  joined  by  a  path.) 
The  level  of  a  vertex  v  in  a  graph  G  with  respect  to  a  fixed  root  r  is  the  distance  from  r 
to  v. 

If  G i  =  ( Vi,  Ei)  and  G2  =  ( V2,  E2)  are  graphs,  G\  is  a  subgraph  of  G2  if  V,  s  V2 
and  £|  S  E2.  G i  is  a  generalized  subgraph  of  G2  if  V,  c  V2  and  there  is  a  mapping  f 
from  E\  into  the  set  of  paths  of  02  such  that,  for  each  edge  (v,  tv)e  £,,  f((v.  w))  has 
endpoints  v  and  w,  and  no  two  paths  f((v i,  h>i))  and  /((i>2,  w2))  share  a  vertex  except 
possibly  an  endpoint  of  both  paths.  IfG  =  (V|,£t)isa  graph  and  V|  s  V2,  the  graph 
G\  =  (V|,  £t)  where  E\  =  f2n[(t\  w)|i\  we  V,}  is  the  subgraph  of  G2  induced  by  the 
vertex  set  V,.  If  G|  =  ( Vt,  E\)  is  a  subgraph  of  G2  =  ( V2,  £2),  then  shrinking  G\  to  a 
single  vertex  in  G2  means  forming  a  new  graph  G2  from  G2  by  deleting  from  G2  all 
vertices  in  V,  and  all  their  incident  edges,  adding  a  new  vertex  x  to  G2,  and  adding  a 
new  edge  (x.  w)  to  G2  for  each  edge  (v,  w)e  £2  such  that  ve  V,  and  wt i  W 

A  graph  is  connected  if  any  two  vertices  in  it  are  joined  by  a  path.  The  connected 
components  of  a  graph  are  its  maximal  connected  subgraphs.  A  clique  is  a  graph  such 
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that  any  two  vetices  are  joined  by  an  edge.  A  tree  is  a  connected  graph  containing  no 
cycles.  We  shall  generally  assume  that  a  tree  has  a  distinguished  vertex,  called  a  root.  If 
f  is  a  tree  with  root  r  and  v  is  on  the  (unique)  simple  path  from  r  to  tv,  v  is  an  ancestor 
or  w  and  tv  is  a  descendant  of  v.  If  in  addition  (t>,  tv)  is  an  edge  of  T,  then  v  is  the 
parent  of  w  and  tv  is  a  child  of  v.  The  radius  of  a  tree  is  the  maximum  distance  of  any 
vertex  from  the  root.  A  spanning  tree  Tola  graph  G  is  a  subgraph  of  G  which  is  a  tree 
and  which  contains  all  the  vertices  of  G.T  is  a  breadth-first  spanning  tree  with  respect 
to  a  root  r  if,  for  any  vertex  v,  the  distance  from  r  to  v  in  T  is  equal  to  the  distance 
from  r  to  v  in  G. 

A  graph  G  =  ( V,  E)  is  planar  if  there  is  a  one-to-one  map  f\  from  v  into  points  in 
the  plane  and  a  map  /2  from  E  into  simple  curves  in  the  plane  such  that,  for  each  edge 
(t>,  iv)€£,  w))  has  endpoints  f{(v)  and  f2(w),  and  no  two  curves  f2((v\,  *v,)), 

f2((v 2,  *v2))  share  a  point  except  possibly  a  common  endpoint.  Such  a  pair  of  maps  ft, 
f2  is  a  planar  embedding  of  G.  The  connected  planar  regions  formed  when  the  ranges 
of  /i  and  f2  are  deleted  from  the  plane  are  called  the  faces  of  the  embedding.  Each 
face  is  bounded  by  a  curve  corresponding  to  a  cycle  of  G ,  called  the  boundary  of  the 
face.  We  shall  sometimes  not  distinguish  between  a  face  and  its  boundary.  A  diagonal 
of  a  face  is  an  edge  (v,  w)  such  that  t>  and  tv  are  nonadjacent  vertices  on  the  boundary 
of  the  face. 
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Abstract. 

Any  n-vertex  planar  graph  has  the  property  that 
it  can  be  divided  into  components  of  roughly  equal  size 
by  removing  only  0( 'In  )  vertices.  This  separator 
theorem,  in  combination  with  a  divide-and-conquer 
strategy,  leads  to  many  new  complexity  results  for 
planar  graph  problems.  This  paper  describes  seme  of 
these  results. 
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1.  Introduction. 

One  efficient  approach  to  solving  computational 
problems  is  "divide-and-conquer"  [1].  In  this  method, 
the  original  problem  is  divided  into  two  or  more 
smaller  problems.  The  subproblems  are  solved  by 
applying  the  method  recursively,  and  the  solutions  to 
the  subproblems  are  combined  to  give  the  solution  to 
the  original  problem.  Divide-and-conquer  is  especially 
efficient  when  the  subproblems  are  substantially 
smaller  than  the  original  problem. 

In  [lL]  the  following  theorem  is  proved. 

Theorem  1.  Let  G  be  any  n-vertex  planar  graph  with 
non -negative  vertex  costs  sunning  to  no  more  than  one. 
Then  the  vertices  of  G  can  be  partitioned  into  three 
sets  A ,  B ,  C  ,  such  that  no  edge  Joins  a  vertex  in  A 
with  a  vertex  in  B  ,  neither  A  nor  B  has  total 
vertex  .cost  exceeding  2/3  ,  and  C  contains  no  more 
than  2v2vn  vertices.  Furthermore  A,  B,C  can  be 
found  in  0(n)  time. 

In  the  special  case  of  equal-cost  vertices,  this 
theorem  becomes 

Corollary  1.  Let  G  be  any  n-vertex  planar  graph. 

The  vertices  of  G  can  be  partitioned  into  three  sets 
A ,  B ,  C  ,  such  that  no  edge  Joins  a  vertex  in  A  with 
a  vertex  in  B  ,  neither  A  nor  B  contains  more 
thap  2n/3  vertices,  and  C  contains  no  more  than 
2V2vn  vertices. 

Theorem  1  and  its  corollary  open  the  way  for 
efficient  application  of  divide-and-conquer  to  a 
variety  of  problems  on  planar  graphs.  In  this  paper 
we  explore  a  number  of  such  applications.  Each 
section  of  the  paper  describes  a  different  use  of 
divide-and-conquer.  The  results  range  from  an 


efficient  approximation  algorithm  for  finding  maximum 
independent  sets  in  planar  graphs  to  lower  bounds  on 
the  complexity  of  planar  Boolean  circuits.  The  last 
section  mentions  two  additional  applications  whose 
description  is  too  lengthy  to  be  included  in  this 
paper. 

2.  Approximation  Algorithms  for  NP-Complete  Problems. 

Divide-and-conquer  in  combination  with  Theorem  1 
can  be  used  to  rapidly  find  good  approximate  solutions 
to  certain  NP-ccnplete  problems  on  planar  graphs.  As 
an  example  we  consider  the  maximum  independent  set 
problem,  which  asks  for  a  maximum  number  of  pairwise 
non-adjacent  vertices  in  a  planar  graph. 

Theorem  2.  Let  G  be  an  n-vertex  planar  graph  with 
non-negative  vertex  costs  summing  to  no  more  than  one 
and  let  0  <  e  <  1  .  Then  there  is  some  set  C  of 
0(Vn/e )  vertices  whose  removal  leaves  G  with  no 
connected  component  of  cost  exceeding  e  .  Further¬ 
more  the  set  C  can  be  found  in  0(n  log  n)  time. 

Proof.  Apply  the  following  algorithm  to  G  . 

Initialization:  Let  C  «  p  . 

General  Step:  Find  seme  connected  component  K  in 

G  minus  C  with  cost  exceeding  e  .  Apply 

Corollary  1  to  K  ,  producing  a  partition 

A1  ’  ®1  '  C1  of  its  vertices.  Let  C  «  C  u  . 

If  one  of  A^  and  B^  (say  )  has  cost 

exceeding  two-thirds  the  cost  of  K  ,  apply 

Theorem  1  to  the  subgraph  of  G  induced  by  the 

vertex  set  A^  ,  producing  a  partition  Ag  ,  Bg  , 

C2  of  Ax  .  Let  C  =  CUC-,  . 

Repeat  the  general  step  until  G  minus  C  has 
no  component  with  cost  exceeding  e  . 

The  effect  of  one  execution  of  the  general  step 
is  to  divide  the  component  K  into  smaller  components, 
each  with  no  more  than  two-thirds  the  cost  of  K  and 
each  with  no  more  than  two-thirds  as  many  vertices 
as  K  .  Consider  all  components  which  arise  during 
the  course  of  the  algorithm.  Assign  a  level  to  each 
canqjonent  as  follows.  If  the  component  exists  when 
the  algorithm  halts,  the  component  has  level  zero. 
Otherwise  the  level  of  the  component  is  one  greater 
than  the  maximum  level  of  the  components  formed  when 
it  is  split  by  the  general  step.  With  this  definition, 
any  two  components  on  the  same  level  are  vertex- 
disjoint. 

Each  level  one  component  has  cost  greater  than  e , 
since  it  is  eventually  split  by  the  general  step.  It 
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follows  that,  for  1  >  1  ,  each  level  i  component 
has  cost  at  least  (3/2  J1'1*  and  contains  at  least 

(3/2 1*  vertices.  Since  the  total  cost  of  G  is  at 
most  one,  the  total  number  of  exponents  of  level  i 

Is  at  most  (2/3  J1*1/  £  . 

The  total  running  tine  of  the  algorithm  is 
0(£  (|K|  I  I  Is  a  component  split  by  the  general 
step})  .  Since  a  component  of  level  i  contains  at 
least  (3/2 )*  vertices,  the  maximum  level  k  must 
satisfy  (3/2 )k  <  n  ,  or  k  <  l0Sj/2  n  • 

components  in  each  level  are  vertex-disjoint,  the 
total  running  time  of  the  algorithm  is 
0(n  1085/2  "  °^D  108  * 

The  total  size  of  the  set  C  produced  by  the 
algorithm  is  bounded  by 


0(£  [f\K\  I  K  is  a  ccmponent  split  by  the  general 
step}) 


-  O^Vn/f  £  (2/5  ^  ■  0(</n/c  )  .  O 


The  following  algorithm  uses  Theorem  2  to  find  on 
approximately  maximum  independent  set  I  in  a  planar 
graph  G  »  (V,E)  . 

8tep  1.  Apply  Theoros  2  to  G  with 

c  ■  (log  log  n)/n  and  each  vertex  having 
cost  1/n  to  find  a  set  of  vertices  C 

containing  0(n/ i/log  log  n  )  vertices  whose 
ronoval  leaves  no  connected  cceiponent  with 
more  than  log  log  n  vertices. 

Step  g.  In  each  connected  component  of  G  minus  C  , 
find  a  maximum  independent  set  by  checking 
every  subset  of  vertices  for  independence, 
for*  I  as  a  union  of  maximum  independent 
sets,  one  from  each  component. 


Let  I*  be  a  maximum  independent  set  of  G  .  The 
restriction  of  I*  to  one  of  the  connected  components 
formed  when  C  Is  removed  from  G  can  be  no  larger 
than  the  restriction  of  I  to  the  same  component. 

Thus  |I*|  -  |I|  -  O(n/Vlog  log  n  )  .  Since  G  is 
planar,  0  Is  four-colorable,  and  |l»]  >  n/4  , 

Thus  (|I*|-|I|)/ |I*|  -  O(l/Vlog  log  n  )  ,  and  the 

relative  error  In  the  size  of  I  tends  to  zero  with 
Increasing  n  . 

Step  1  of  the  algorithm  requires  0(n  log  n) 

n, 

time  by  Tbeora  2.  Step  2  requires  0(n1  2  )  tine 

cm  a  connected  component  of  n^  vertices.  The  total 


time  required  by  Step  2  is  thus 


f  i  n  n,  n 

0  max/  £  n.  2  |  £  n. 

V  Vi-1  i.l  1 


n  and 


}) 


0  <  Sj  <  log  log  n 

0  ^ log  log  n  lQ«  ")gl08l0gn)  -  0(n  log 

Hence  the  entire  algorithm  requires  0(n  log  n) 


n)  . 
time. 


3.  Nonserial  Dynamic  Programing. 


Many  NF-canplete  problems,  such  as  the  maxisnsn 
independent  set  problem,  the  graph  coloring  problem, 
and  others,  can  be  formulated  as  non-serial  dynamic 
programming  problems  [2,20].  Such  a  problem  is  of 
the  fallowing  form:  minimize  the  objective  function 
f(x^,...,xn)  ,  where  f  is  given  as  a  sum  of  terms 

fk(>)  ,  each  of  which  is  a  function  of  only  a  subset 

of  the  varieties.  He  shall  as  mine  that  all  variables 
x^  take  on  values  frcm  the  same  finite  set  S  ,  and 

that  the  values  of  the  terms  fk(- )  are  given  by 


tables.  Associated  with  such  an  objective  function  f 
is  an  interaction  graph  G  -  (V, E)  ,  containing  one 
vertex  v,  for  each  variable  Xj  in  f  ,  and  an 


edge  Joining  xA  and  x^  for  any  two  variables 
and  Xj  which  appear  in  a  conmon  term  fk( • )  . 

By  trying  ail  possible  values  of  the  variables, 
a  nonserial  dynamic  programming  problem  can  be  solved 


in  time.  We  shall  show  that  if  the  inter¬ 

action  graph  of  the  problem  is  planar,  the  problem 

can  be  solved  in  2°  n  ^  time.  This  means  that 
substantial  savings  are  possible  when  solving  typical 
NP-ccmplete  problems  restricted  to  planar  graphs. 

Note  that  if  the  interaction  graph  of  f  is  planar, 
no  term  fk(.)  of  f  can  contain  more  than  four 


variables,  since  the  canplete  graph  on  five  vertices 
is  not  planar. 


In  order  to  describe  the  algorithm,  we  need  one 
additional  concept.  The  restriction  of  an  objective 

m 

function  f  -  £  f.  to  a  set  of  variables 

k-  1 

x,  ,...,x  is  the  objective  function 

ll  1J 

f  •  £{fk  |  fk  depends  upon  one  or  more  of  xi  ,...,xi  }. 


Given  an  objective  function  f(x^, ...,xn)  - 
m 

£  f.  and  a  subset  S  of  the  variables  x.,...,x 
k.l  K  in 

which  are  constrained  to  have  specific  values,  the 
following  algorithm  solves  the  problem:  maximize  f 
subject  to  the  constraints  on  the  variables  in  S  . 

In  the  presentation,  we  do  not  distinguish  between 
the  variables  Xj, •••,xh  and  the  corresponding 

vertices  in  the  interaction  graph. 

Step  1.  If  n  <  9  ,  solve  the  problem  by  exhaustively 
trylng~all  possible  assignments  to  the 
unconstrained  variables.  Otherwise,  go  to 
Step  2. 


1 


76 


•  ;  ''US  OF  PAGES  '//iii  JH  00  WI 

— —  J  UL'/. 


Step  2.  Apply  Corollary  1  to  the  Interaction  graph 
0  of  t  .  Let  A  ,  B ,  C  he  the  resulting 
vertex  partition.  Let  be  the  restric¬ 
tion  of  f  to  AUC  and  let  f2  be  the 

restriction  of  f  to  BuC  .  For  each 
possible  assignment  of  values  to  the 
variables  in  C-E  ,  perform  the  following 
steps: 

(a)  Maximize  f^  with  the  given  values  for 

the  variables  in  C  u  S  by  applying  the 
stethod  recursively; 

(b)  Maximize  f£  with  the  given  values  for 

the  variables  in  C  u  S  by  applying  the 
method  recursively; 

(c)  Combine  the  solutions  to  (a)  and  (b)  to 
obtain  a  maxi  mm  value  of  f  with  the 
given  values  for  the  variables  in  C  u  S  • 

Choose  the  assignment  of  values  to  variables 
in  C-S  which  maximizes  f  and  return  the 
appropriate  value  of  f  as  the  solution. 

The  correctness  of  this  algorithm  is  obvious.  If 

n  >  9  ,  the  algorithm  solves  at  most  2°^''°  )  sub- 
problems  in  Step  2,  since  C  Is  of  0 (Vn  )  size. 

Each  subproblem  contains  at  most 

2n/5  *  2v2vn  <  29n/30  variables.  Thus  if  t(n)  is 
the  running  time  of  the  algorithm,  we  have 

t(n)  <  0(n  log  n)*2°<~‘^)  •  t(29n/30)  if  n>9, 
t(n)  «=  0(1)  if  n  <  9  .  An  inductive  proof  shows 

that  t(n)  <  2°^  ^  . 


fc.  Pebbling. 

The  following  one-person  game  arises  in  register 
allocation  problems  [21],  the  conversion  of  recursion 
to  iteration  [16],  and  the  study  of  time-space  trade¬ 
offs  [U,l0,l8].  Let  G  m  (V, E)  be  a  directed  acyclic 
graph  with  maximum  in-degree  k  .  If  (v,w)  is  an 
edge  of  G  ,  v  is  a  predecessor  of  w  and  w  is  a 
successor  of  v  .  The  game  involves  placing  pebbles 
on  the  vertices  of  G  according  to  certain  rules. 

A  given  step  of  the  game  consists  of  either  placing  a 
pebble  on  an  empty  vertex  of  G  (called  pebbling  the 
vertex)  or  rmnoving  a  pebble  fran  a  previously  pebbled 
vertex.  A  vertex  may  be  pebbled  only  if  all  its 
predecessors  have  pebbles.  The  object  of  the  game  is 
to  successively  pebble  each  vertex  of  G  (in  any 
order)  subject  to  the  constraint  that  at  most  a  given 
nuaber  of  pebbles  are  ever  on  the  graph  simultaneously. 

It  is  easy  to  pebble  any  vertex  of  an  n-vertex 
graph  in  n  steps  using  n  pebbles.  We  are 
Interested  in  pebbling  methods  which  use  fewer  than  n 
pebbles  but  possibly  many  more  than  n  steps.  It  is 
known  that  any  vertex  of  an  n-vertex  graph  can  be 
pebbled  with  0(n/log  n)  pebbles  [10]  (where  the 
constant  depends  upon  the  maxlmvas  in-degree),  end  that 
in  general  no  better  bound  is  possible  [18 ] .  We  shall 
show  that  if  the  graph  is  planar,  only  0(vn  )  pebbles 
are  necessary,  generalizing  a  result  of  [18] .  An 
example  of  Cook  [U]  shows  that  no  better  bound  is 
possible  for  planar  graphs. 

Theorem  3.  Any  n-vertex  planar  acyclic  directed 
graph  with  maxi  sain  in- degree  k  can  be  pebbled  using 
0(vn  ♦  k  logg  n)  pebbles. 

Proof.  Let  a  ■  2 Vs  and  3  ■  2/3  ,  Let  G  be  the 
graph  to  be  pebbled.  Use  the  following  recursive 


pebbling  procedure.  If  n  <  uq.  ,  where 
nQ  -  (Q/(l-e))2  ,  pebble  all  vertices  of  G  without 
deleting  pebbles.  If  n  >  nQ  ,  find  a  vertex 
partition  A,  B,  C  satisfying  Corollary  1.  Pebble 

the  vertices  of  G  in  topological  order. ^  To  pebble 
a  vertex  v  ,  delete  all  pebbles  except  those  an  C  . 
For  each  predecessor  u  of  v  ,  let  G(u)  be  the 
subgraph  of  G  Induced  by  the  set  of  vertices  with 
pebble-free  paths  to  u  .  Apply  the  method  recursively 
to  each  G(u)  to  pebble  all  predecessors  of  v  , 
leaving  a  pebble  on  each  such  predecessor.  Then 
pebble  v  . 

If  p(n)  is  the  maxlmvas  number  of  pebbles 
required  by  this  method  an  any  n-vertex  graph,  then 

p(n)  -  n  if  n  <  Uq  , 

p(n)  <  a-/n  +  k  +  p(2n/3  +  aVn  )  if  n  >  nQ  . 

An  inductive  proof  shows  that  p(n)  is 
0(Vn  +  k  logg  n)  .  □ 

It  is  also  possible  to  obtain  a  substantial 
reduction  in  pebbles  while  preserving  a  polynomial 
bound  on  the  number  of  pebbling  steps,  as  the 
following  theorem  shows. 

Theorem  U.  Any  n-vertex  planar  acyclic  directed 
graph  with  maximum  in-degree  k  can  be  pebbled  using 

0(n  ^  *■  k)  pebbles  in  time. 

proof.  Let  C  be  a  set  of  0(n2^)  vertices  whose 
removal  leaves  G  with  no  weakly  connected 

component^  containing  more  than  n2^  vertices. 

Such  a  set  C  exists  by  Theorem  2.  The  following 
pebbling  procedure  places  pebbles  permanently  on  the 
vertices  of  C  .  Pebble  the  vertices  of  G  in 
topological  order.  To  pebble  a  vertex  v  ,  pebble 
each  predecessor  u  of  v  and  then  pebble  v  ,  To 
pebble  a  predecessor  u  ,  delete  all  pebbles  from  G 
except  those  on  vertices  in  C  or  on  predecessors 
of  v  .  Find  the  weakly  connected  component  in  G 
minus  C  containing  u  .  Pebble  all  vertices  in 
this  ccinponent,  in  topological  order. 

The  total  number  of  pebbles  required  by  this 
strategy  is  0(n2^)  to  pebble  vertices  in  C  plus 

n2/^  to  pebble  each  weakly  connected  component  plus 
k  to  pebble  predecessors  of  the  vertex  v  to  be 
pebbled.  Hie  total  number  of  pebbling  steps  is  at 

most  Ojk-n-n2^)  »  0(kn^^)  .  Q 


5,  Lower  Bounds  on  Boolean  Circuit  Size. 

A  Boolean  circuit  is  an  acyclic  directed  graph 
such  that  each  vertex  has  in-degree  zero  or  two,  the 
predecessors  of  each  vertex  are  ordered,  and 
corresponding  to  each  vertex  v  of  in-degree  two  is 
a  binary  Boolean  operation  by  .  With  each  vertex  of 

the  circuit  we  associate  a  Boolean  function  ifelch  the 
vertex  caqmtes,  defined  as  follows.  With  each  of 
the  k  vertices  vA  of  ln-degree  zero  (inputs) 

That  it,  in  an  order  such  that  if v is  a 
predecessor  of  w  ,  v  is  pebbled  before  w  . 

2/  A  weakly  connected  component  of  a  directed  graph 
is  a  connected  component  oT” the  undirected  graph 
foraed  by  ignoring  edge  directions. 
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we  associate  a  variable  Xi  and  an  Identity  function 
t  (x. )  ■  x,  .  with  each  vertex  w  of  in-degree  two 

Vj  i  1 

having  predecessors  u,  v  ve  associate  the  function 
f  ■  by(fu,  fy)  .  The  circuit  computes  the  set  of 

functions  associated  vlth  its  vertices  of  out-degree 
zero  (outputs). 

He  are  Interested  in  obtaining  lover  hounds  on 
the  size  (nMober  of  vertices)  of  Boolean  circuits 
uhich  caspute  certain  ccmmon  and  important  functions. 
Using  Theorm  1  we  can  obtain  such  lover  bounds 
under  the  asstnption  that  the  circuits  are  planar. 

Any  circuit  can  be  converted  into  a  planar  circuit  by 
the  following  steps.  First,  oabed  the  circuit  in  the 
plane,  allowing  edges  to  cross  if  necessary.  Next, 
replace  each  pair  of  crossing  edges  by  the  crossover 
circuit  illustrated  in  Figure  1.  It  follows  that  any 
lower  bound  on  the  size  of  planar  circuits  is  also  a 
lower  bound  on  the  total  number  of  vertices  and  edge 
crossings  in  any  planar  representation  of  a  n on-planar 
circuit.  In  a  technology  for  which  the  total  number 
of  vertices  and  edge  crossings  Is  a  reasonable  measure 
of  cost,  our  lower  bounds  imply  that  it  may  be 
expensive  to  realize  certain  coononly  used  functions 
in  hardware. 

A  superconcentrator  is  an  acyclic  directed  graph 
with  m  inputs  and  m  outputs  such  that  any  set  of 
k  inputs  and  any  set  of  k  outputs  are  joined  by  k 
vertex-disjoint  paths,  for  all  k  in  the  range 
1  <  k  <  m  . 

Theorem  5.  Any  m- Input,  m-output  planar  supercon¬ 
centrator  contains  at  least  m2/72  vertices. 

Proof.  Let  Q  be  an  m-input,  m-output  planar 
superconcentrator.  Assign  to  each  input  and  output 
of  G  a  cost  of  l/(2m)  ,  and  to  every  other  vertex 
a  cost  of  zero.  Let  A ,  B ,  C  be  a  vertex  partition 
satisfying  Theorem  1  an  G  (ignoring  edge  directions). 
Suppose  C  contains  p  Inputs  and  outputs.  Without 
loss  of  generality,  suppose  that  A  is  no  more  costly 
than  B  ,  and  that  A  contains  no  more  outputs  than 
inputs.  A  contains  between  2m/J  -  p  and  m  -  p/2 
Inputs  and  outputs.  Hence  A  contains  at  least 
m/3  -  p/2  inputs  and  at  most  m/2  -  p/4  outputs. 

B  contains  at  least  m-p-  (m/2  -  p/4)  •  m/2  -  3p/4 
outputs.  Let  k  m  min{rm/3  -  p/21 ,  [ m/2  -  3p/41  }  . 
Since  G  is  a  superconcentrator,  any  set  of  k 
inputs  in  A  and  any  set  of  k  outputs  in  B  are 
joined  by  k  vertex-disjoint  paths.  Each  such  path 
must  contain  a  vertex  in  C  which  is  neither  an 
input  nor  an  output.  Thus 

2 V2vir  -  p  >  mln(m/3  -  p/2 ,  m/2  -  3p/4}  >  m/3  -  p  , 
and  n  >  m2/72  .  D 

The  property  of  being  a  superconcentrator  is  a 
little  too  strong  to  be  useful  in  deriving  lover 
botmdg  on  the  casplexity  of  Interesting  functions. 
Htnrever,  there  are  weaker  properties  which  still 

o 

require  n(m  )  vertices.  Let  G  «  (V,E)  be  an 
acyclic  directed  graph  with  m  numbered  inputs 
vl’v2’*-,'vm  *nd  B  n™tered  outputs  WyWg,  ...,wB  . 

G  is  said  to  have  the  shifting  property  if,  for  any 
k  in  the  range  1  <  k  <  m  ,  any  t  In  the  range 
0  <  I  <  m-k  ,  and  ary  subset  of  k  sources 
Jwj  ,.7.,v^}  such  that  ij, igl . .., i^  <  m-/  ,  there 

are  k  vertex-disjoint  paths  joining  the  set  of 
inputs  fv,  ,...,v.  )  with  the  set  of  outputs 
ll  *k 
{Vii+#,...,Vik+/)  . 


Theorem  6.  Let  G  be  a  planar  acyclic  directed 

graph  with  the  shifting  property.  Then  G  contains 
o 

at  least  (_m/2 j  /162  vertices. 

Proof.  Suppose  that  G  contains  n  vertices. 

Assign  a  cost  of  1/m  to  each  of  the  first  |_m/2j 
inputs  and  to  each  of  the  last  |_m/2j  outputs 
of  G  ,  and  a  cost  of  zero  to  every  other  vertex 
of  G  .  Call  the  first  |_m/2  j  inputs  and  the  last 
(_m/2j  outputs  of  G  costly.  Let  A,  B,  C  be  a 
vertex  partition  satisfying  Theorem  1  on  G 
(ignoring  edge  directions). 

Without  lose  of  generality,  suppose  that  A  Is 
no  more  costly  than  B  ,  and  that  A  contains  no 
more  costly  outputs  than  coetly  inputs.  Let  A'  be 
the  eet  of  coetly  Inputs  in  A  ,  B'  the  set  of 
costly  outputs  in  B  ,  p  the  number  of  costly 
Inputs  end  outputs  in  C  ,  and  q  the  number  of 
costly  inputs  and  outputs  in  A  .  Then 
2|_m/2j/3  -  P  <  <1  <  |_m/2j  -  p/2  .  Hence 

|A' |  >  q/2  >  L“/2j/3  -  p/2  .  Also 
|A’  MB' |  £  |A'|.(|m/2j  -  p  -  (q-  |A'|)) 

£  <a/2  •  (lV2j  -  p  -  9/2) 

£  ( L®/2j/3  -  p/2)(l»/2J  -  P  - 

Lm/2j/3  +  p/2) 

-  (L»/Sj/3  -  p/2)  (2 L®/2 J  /3  -  p/2) 

£  2L“/2j2/q  -  PL“/2j/2  • 

For  v.(A'  ,  Vj  c  B’  ,  and  |  in  the  range 

1  <  l  <  |_m/2 j  ,  call  vi  >  wi  >  *  a  m»*ch  if 

j-i  rn  l  .  For  every  Vj c A’  end  Wj  e  B'  there  is 

exactly  one  value  of  I  which  produces  a  match;  hence 

the  total  number  of  matches  for  all  possible  v^  ,  w^  , 

I  is  |A'|*|B'|  >  2|_m/2j2/q  -  p|_m/2j/2  .  Since 

there  are  only  |_m/2j  values  of  l  ,  some  value  of 

/  produces  at  least  2l_m/2j/q  -  p/2  matches.  Thus, 

for  k  •  2|_m/2j/q  -  p/2  ,  there  is  some  value  of  I 

add  some  set  of  k  inputs  A"  »  fv,  , V,  , . . . , v,  ]  c 

■*1  2  k  - 

A'  euch  that  B"  +1»  •••»vi)  +  |)  £  B'  . 

Since  G  has  the  shifting  property,  there  must  be  k 
vertex-disjoint  paths  between  A"  and  B"  .  But  each 
such  path  must  contain  a  vertex  of  C  which  is 
neither  an  input  nor  an  output.  Hence 

2'Jz,Jn  -  p  >  2(.m/2j/q  -  p/2  ,  and 
n  >  Lm/2j2/l62  .  Q 

A  shifting  circuit  is  a  Boolean  circuit  with  m 
primary  Inputs  x^,  x^, . . . ,  xB  ,  zero  or  more  auxiliary 

inputs,  and  m  outputs  z^, tg, . . . ,  zB  ,  such  that,  for 

any  k  in  the  range  0  <  k  <  m  ,  there  is  same 
assignment  of  the  constants  ”0,1  to  the  auxiliary 
inputs  so  that  output  z^+^  c (Disputes  the  identity 

function  Xj  ,  for  0  <  i  <  m-k  .  The  Boolean 
convolution  of  two  Boolean  vectors  (x^,  xg>  •  •  •  > 

«nd  (yj#yg.  •••»y>)  is  the  vector  (z2,  z^, .  ...z^) 

given  by  t.  »  £  x.y,  . 

i+J-k  J 


! 
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Corollary  2.  Any  planar  shifting  circuit  has  at 
least  Lm/2j2/l62  vertices. 

Proof.  Any  shifting  circuit  has  the  shifting 
property.  See  [2}, 21).  □ 

Corollary  5.  Any  planar  circuit  for  ccoputing 

2 

Boolean  convolution  has  at  least  ^ra/2j  /162 
vertices. 


1/4  -  c^s/n  / ( Uje^  >  to  the  cost  of  B  .  Ttaus  inputs 

contribute  at  most  1/1*  -  c.y[n/(k^F)  to  the  cost 
1  2  r— 

of  B  ,  and  B  contains  at  most  m  ♦  c^vn  inputs. 

A  contains  at  least  2td2  -  (m?  *  c  /n  )  -  c.  /n  * 

2  1 —  1  p  — . 

m  -2c^vn  inputs,  of  which  at  least  m  /?  -  c^vn 
are  inputs  x^  .  One  of  the  following  cases  must 
hold. 


Proof.  A  circuit  for  computing  Boolean  convolution 

is  a  shirting  circuit  if  we  regard  x.,...,x  as  the 

i.  m 

primary  inputs  and  as  the  outputs.  □ 

CoroLlary  h.  Any  planar  circuit  for  computing  the 
product  of  two  m  bit  binary  integers  has  at  least 
2 

l  m/2 j  / 16?  vertices. 


Proof.  A  circuit  for  multiplying  two  m-bit  binary 
integers  is  a  shifting  circuit.  □ 


The  last  result  of  this  section  is  an  o(m  ) 
lower  bound  on  the  size  of  any  planar  circuit  for 
multiplying  two  my®  Boolean  matrices.  We  shall 
assume  that  the  inputs  are  j  *  i  ^or  1  j  <  b 

and  the  outputs  are  z.  ^  for  1  <  i, j  <  m  .  The 
circuit  computes  Z  =  X*Y  ,  where  Z  ■  (z^)  , 

X  «  (xij^  >  Y  *  (yi j >  .  We  use  the  following 

property  of  circuits  for  multiplying  Poolean  matrices* 
called  the  matrix  concentration  property  [23,2l  ].  For 

2 

any  k  in  the  range  1  <  k  <  n  ,  any  set 
(x.  .  |  1  <  r  <  k}  of  k  inputs  from  X  ,  and  any 
rJr  ~ 

permutation  0  of  the  integers  one  through  n  ,  there 
exist  k  vertex-disjoint  paths  from 

(xi  j  1 1  < r  <  ki  to  (2i  <j(j  \  l 1 1 r  ^  ki  • 

rJr  r  'Jr 

Similarly,  for  any  k  in  the  range  1  <  k  <  n  ,  any 

set  (y.  .  |  1  <  r  <  k]  of  k  inputs  from  Y  ,  and 

rJr 

any  permutation  0  of  one  through  n  ,  there  exist  k 


vertex-disjoint  paths  from 


1  <  r  <  k]  . 


(y,  ,  1 1  <  r  <  k3  to 

Vr 


Theorm  7.  Any  planar  circuit  G  for  multiplying  two 

l 

m x m  Boolean  matrices  contains  at  least  cm  vertices, 
for  some  positive  constant  c  . 


Proof.  This  proof  is  somewhat  involved,  and  we  make 
no  attempt  to  maximize  the  constant  factor.  Suppose 
G  contains  n  vertices,  and  that  m  is  even. 

Assign  a  cost  of  l/(l«n2)  to  each  input  x .  .  and 

2 

each  input  yij  ,  a  cost  of  1/ (2m  )  to  each  output 
,  and  a  cost  of  zero  to  every  other  vertex.  There 

is  a  partition  A,  B,  C  of  the  vertices  of  0  such 
that  neither  A  nor  B  has  total  cost  exceeding 
1/2  ,  no  edge  joins  a  vertex  in  A  with  a  vertex 
in  B  i  and  C  contains  no  more  than 

2'Ji'ln/  (1  -  V2/5  )  •  vertices.  This  is  a 

corollary  of  Theorem  1;  see  [lU],  Without  loss  of 
generality,  suppose  that  B  contains  no  fever  outputs 
than  A  ,  and  that  A  contains  no  fever  inputs  x^ 

than  inputs  y^  .  Then  B  contains  at  least 
(m2  -  outputs,  vhich  contribute  at  least 


2 

Case  1.  A  contains  at  least  '5  inputs  x.  .  . 

-  2J 

Let  p  be  the  number  of  columns  of  X  which  contain 

at  least  Um/7  elements  of  A  .  Then 
2 

pm*  (m-p)(l*m/7)  >  3®  /5  >  and  p  >  m/15  .  Let  q  be 
the  number  of  columns  of  Z  which  contain  at  least 
Um/9  elements  of  B  .  Then  qm  *  (m-q)  (Um/q’'  > 

nT  ;c  -  c^/rT/2  ,  and  q  >  m/10  -  9c_^/n/(l0m)  . 

Let  k  *  minfm/15  1  m/10  -  /  (10m1' }  .  Choose 

any  k  columns  of  X  ,  each  of  which  contains  at 
least  Um/7  elements  of  A  .  Match  each  such  colunn 
of  X  with  a  column  of  Z  which  contains  at  least 
1«b/9  elements  of  B  .  For  each  pair  of  matched 

columns  x#^  ,  z#^  ,  select  a  set  of  Um/7  ♦  Um/9  - »  = 

m/t?  rows  /  such  that  is  in  A  and  is 

in  B  .  Such  a  selection  gives  a  set  of  tau/t-3 
element?  in  X r  A  and  a  set  of  km/63  elements  in 
Z  r  B  which  must  be  joined  by  km/oj  vertex-disjoint 
paths,  since  G  has  the  matrix  concentration  property. 
Each  such  path  must  contain  a  vertex  of  C  .  Thus 

km/t'3  <  c ,  which  means  either  ro?/(15*b3''  < 
c  1/fT  (i.e.,  (m2/ (15* 63c. ) )2  <  n  >  or 
m/63 (m '10  -  9c1Vn*/(10m))  <  c^*fn  (i.e., 
(mc/(9,69c1))2  <  n  ). 

Case 


A  contains  fewer  than  3®  /5  inputs  x. 


iJ 


Then  A  contains  at  least  2in2/5  -  Pc^ViT  inputs 
y.  ^  .  Let  S  be  the  set  of  m/2  columns  of  Z  which 
contain  the  most  elements  in  B  . 


2 

Subcase  2a.  S  contains  at  least  3®  /10  elements 
in  B  .  Let  p  be  the  number  of  columns  of  X  which 
contain  at  least  Um/9  elements  of  A  .  Then 
2  ^ 

pjn  *  lt(m-p)m/9  >  m  /2  -  c^  Ja  ,  and 
p  >  m/10  -  9c1Vn/(5ml  .  Let  q  be  the  number  of 

columns  of  Z  vhich  contain  at  least  Lm/7  elements 

2 

of  B  .  Then  cp  ♦  Mm/2  -  q^m/7  >  3®  /10  *  and 

q  >  m/30  .  A  proof  similar  to  that  in  Case  1  shows 
U 

that  n  >  cm  for  some  positive  constant  c  . 

2 

Subcase  2b.  S  contains  fever  than  Jm  /lo  elements 
in  B  .  Then  the  m/2  columns  of  Z  not  in  S 

contain  at  least  m2/5  -  c^'fn/Z  elements  in  B  .  Let 

q  be  the  number  of  columns  of  Z  not  in  S  vhich 
contain  at  least  m/10  elements  In  B  .  Then 

gn*  (m/2  -  q)(m/10)  >  m2/5  -  c^Vn/2  ,  and 
q  >  m/6  -  5c,  Vi)  / (9m)  .  If  0  >  q  >  m/6  -  5c.  Vn’/(9o), 

““  2  n  1 

then  (Jm  /(10c^))  >  n  .  Hence  assume  q  >  0  .  Then 

all  columns  in  S  must  contain  at  least  m/lo 
eluents  in  B  ,  and  2m/J  -  5c1'/n"/(^)  columns  of 
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Z  Bust  contain  at  least  m/10  elements  in  B  . 

Let  p  be  the  number  of  columns  of  Y  which 
contain  at  least  m/25  elements  of  A  .  Then 

pm+  (m-p)  (m/25 )  >  2m2 /5  *  2c^  •Jn  ,  and 
p  >  3m/8  -  25c1‘/iT/(12m)  . 

For  any  input  y^  «  A  and  integer  I  in  the 
range  -n+1  <  t  <  n-1  ,  call  y^  ,  I  a  match  if 

B  .  By  the  previous  computations,  there  are 
at  least  2m/3  -  5c^‘/rT/(9m)  +  3“/8  -25c^'/rT/(12m)  -  m 
•  m/25  -  95c1'/n /06m)  =  m/25  -  c^'Ja/m  columns  J 
such  that  y.j  contains  m/25  elements  of  A  and 
z,.  contains  m/10  elements  of  B  .  Each  such 
column  produces  m  /250  matches;  thus  the  total 
number  of  matches  is  at  least  m^ /6250  -  mc^'/n*/250  . 
Since  there  are  only  2m-l  values  of  /  »  sane  value 
of  l  produces  at  least  k  =  m2/ 12, 500  -  c«\/n/500 

matches.  Since  G  has  the  matrix  concentration 
property,  this  set  of  matches  corresponds  to  a  set  of 
k  elements  in  Y  0  A  and  a  set  of  k  elements  in 
ZHB  which  must  be  Joined  by  k  vertex-disjoint 
paths.  Each  such  path  must  contain  a  vertex  in  C  . 

Thus  k  <  c^'/rT,  which  means 
mU/(12,500(c1  f  c2/500))2  <  n  . 

It 

In  all  cases  n  >  cm  for  some  positive  constant 
c  .  Choosing  the  minimum  c  over  all  cases  gives  the 
theorem  for  even  m  .  The  theorem  for  odd  m  follows 
imnediately.  □ 

The  hounds  in  Theorems  5-7  and  Corollaries  2-1 
are  tight  to  within  a  constant  factor.  We  leave  the 
proof  of  this  fact  as  an  exercise. 

6.  Embedding  of  Data  Structures. 


tree  of  one  vertex,  the  image  of  v  .  Otherwise,  apply 
Corollary  1  to  find  a  partition  A ,  B  ,  C  of  the 
vertices  of  G  .  Let  v  be  any  vertex  in  C  (if  C 
is  empty,  let  v  be  any  vertex).  Embed  the  subgraph 
of  G  induced  by  AuC-(v]  in  a  binary  tree  by 

applying  the  method  recursively.  Embed  the  subgraph  of 
G  induced  by  B  in  a  binary  tree  T?  by  applying  the 

method  recursively.  Let  T  consist  of  a  root  (the 
image  of  v  )  with  two  children,  the  root  of  T^  and 

the  root  of  T„  .  Note  that  the  tree  T  constructed 
in  this  way  has  exactly  n  vertices. 

Let  h(n)  be  the  maximum  depth  of  a  tree  T  of 
n  vertices  produced  by  this  algorithm.  Then 

h(n)  <9  if  n  <  9  > 

h(nl  <  h(2n/J  +  ? i/? -./n  -  1)  <  h(29n'30' 

if  n  >  9  . 

It  follows  that  h(n)  is  0(log  n)  . 

Let  G  =  (V,E)  be  an  n- vertex  graph  to  which  the 
algorithm  is  applied,  let  G^  be  the  subgraph  of  G 

induced  by  A  U  C  ,  and  let  G~  be  the  subgraph  induced 
by  B  .  If  s(G)  =  £  (d2(0(v),0(w))  |  (v,w)  t  E]  ,  then 
s(G)  =0  if  n  =  1  ,  and 

s(G)  <  s(G1)  +  s(G2'  ♦  k  | C |  h(n)  if  n  >  1  .  This 
follows  from  the  fact  that  any  edge  of  G  not  in  C^ 
or  G2  must  be  incident  to  a  vertex  of  C  . 

If  s(n)  is  the  maximum  value  of  s(G)  for  any 
n-vertex  graph  G  ,  then 

s(l)  »  0  ; 

s(n)  <  max{s(i)  +  s(n-i)  +  ck-/n  log  n  | 

n/3  -  Z-JsJn  <  i  <  2n/3  «■  2-B-Jn  } 
if  n  >  1  ,  for  sane  positive  constant,  c  . 


Let  Gx  -  (V^Ej)  and  G2-(V2,Eg)  be 
undirected  graphs.  An  embedding  of  G^  in  Gg  is  a 
one-to-one  map  0:  -  V2  .  The  worst-case  proximity 

of  the  embedding  is  maxfdg^Ov),  0(w) )  j  (v,w)  €  E^J  , 
where  d^fx, y)  denotes  the  distance  between  x  and 
y  in  Gg  •  The  average  proximity  of  the  embedding  is 


T”T  £  {<U0(v),0(v)) 

lfcll 


|  {v,w} c  E^]  .  These  notions 


arise  in  the  following  context.  Suppose  we  wish  to 
represent  some  kind  of  data  structure  by  another  kind 
of  data  structure,  in  such  a  way  that  if  two  records 
are  logically  adjacent  in  the  first  data  structure, 
their  representations  are  close  together  in  the  second. 
We  can  model  the  data  structures  by  undirected  graphs, 
with  vertices  denoting  records  and  edges  denoting 
logical  adjacencies.  The  representation  problem  is 
then  a  graph  aabeddlng  problem  in  which  we  wish  to 
minimize  worst-case  or  average  proximity.  See 
(5,13,191  for  research  in  this  area. 


Theorem  8.  Any  planar  graph  with  maximum  degree  k 
can  be  eSbedded  in  a  binary  tree  so  that  the  average 
proximity  is  a  constant  depending  only  upon  k  . 


Proof.  Let  0  be  an  n-vertex  planar  graph.  Qnbed  G 
in  a  binary  tree  T  by  using  the  following  recursive 
procedure.  If  G  has  one  vertex  v  ,  let  T  be  the 


An  inductive  proof  shows  that  s(n)  is  0(kn)  . 

If  G  is  a  connected  n-vertex  graph  embedded  by 
the  algorithm,  then  G  contains  at  least  n-1  edges, 
and  the  average  proximity  is  0(k)  .  If  G  is  not 
connected,  embedding  each  connected  component 
separately  and  combining  the  resulting  trees 
arbitrarily  achieves  an  0(k)  average  proximity.  D 

It  is  natural  to  ask  whether  any  graph  of  bounded 
degree  can  be  embedded  in  a  binary  tree  with  0(11 
average  proximity.  (Graphs  of  unbounded  degree  cannot 
be  so  embedded;  the  star  of  Figure  2  requires  n(log  n) 
proximity.)  Such  i6  not  the  case,  and  in  fact  the 
property  of  being  embeddable  in  a  binary  tree  with 
0(1)  average  proximity  is  closely  related  to  the 
property  of  having  a  good  separator. 

To  make  this  statement  more  precise,  let  S  be  a 
class  of  graphs.  The  class  S  has  an  f(n)  -separator 
theorem  if  there  exist  constants  a  <  1  ,  &  >  0  such 
that  the  vertices  of  any  n-vertex  graph  in  S  can  be 
partitioned  into  three  sets  A  ,  B  ,  C  such  that 
|  A I ,  f  B  |  <  an  ,  |c|  <  Bf(n)  ,  and  no  vertex  in  A  is 

adjacent  1o  any  vertex'in  B  . 

Let  S  be  any  class  of  graphs  of  bounded  degree 
closed  under  the  subgraph  relation  (i.e.,  if  f  S 

and  G^  is  a  subgraph  of  G?  ,  then  G1 t  S  ).  Suppose 
S  satisfies  an  ng(n)/(log  n)2  separator  theorem  for 
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gome  ncn- decree sing  function  g(n)  .  Using  the  idea 
in  the  proof  of  Theorem  8,  it  is  not  hard  to  show  that 
any  graph  in  S  can  be  eaibedded  in  a  binary  tree  with 
0(g{n))  average  proximity.  Conversely,  suppose  any 
graph  in  a  class  S  can  be  embedded  in  a  binary  tree 
with  0(g(n))  average  proximity.  Then  S  satisfies 
an  ng(n)/log  n  separator  theorem.  In  particular,  if 
S  satisfies  no  o(n)  -separator  theorem,  then 
embedding  the  graphs  of  S  In  binary  trees  requires 
n(log  n)  average  proximity.  Erdos,  Graham,  and 
Szemerddi  [7]  have  shown  the  existence  of  a  class  of 
graphs  of  bounded  degree  having  no  o(n)  -separator 
theorem. 


7.  The  Post  Office  Problem. 


In  [11],  Knuth  mentions  the  following  problem: 
given  n  points  (post  offices)  in  the  plane; 
determine,  for  any  new  point  (bouse),  which  post 
office  it  is  nearest.  Any  preprocessing  of  the  post 
offices  is  allowed  before  the  houses  are  processed. 

2 

Shamos  [22]  gives  an  0(log  n)  -time,  0(n  )  -space 

2 

algorithm  and  an  O((log  n)  )  -time,  0(n  log  n) 

-space  algorithm.  See  also  (6].  Using  Theorem  2 
we  can  give  a  solution  which  requires  0(log  n)  time 
and  0(n)  space,  both  minimum  if  only  binary 
decisions  are  allowed. 

A  polygon  is  a  connected,  open  planar  region 
bounded  by  a  finite  set  of  line  segnents.  (For 
convenience,  we  allow  the  point  at  infinity  to  be  an 
endpoint  of  a  line  segment;  thus  a  line  is  a  line 
segment.)  A  polygon  partition  of  the  plane  is  a 
partition  of  the  plane  into  polygons  and  bounding  line 
segments.  A  trl angulation  of  the  plane  is  a  polygon 
partition,  all  of  whose  polygons  are  bounded  by  three 
line  segments.  A  trl angulation  of  a  polygon  partition 
is  a  refinement  of  the  partition  into  a  triangulation. 
TVo  polygons  in  a  polygon  partition  are  adjacent  if 
their  boundaries  share  a  line  segment.  A  set  of 
polygons  is  connected  if  any  two  polygons  in  the  set 
are  Joined  by  a  sequence  of  adjacent  polygons. 

We  shall  solve  the  following  triangle  problem: 
given  an  n-triangle  triangulation  and  a  point, 
determine  which  triangle  or  line  segment  of  the 
triangulation  contains  the  point.  The  post  office 
problem  can  be  reformulated  as  triangle  problem;  the 
set  of  points  closest  to  each  post  office  forms  a 
polygon  (22j.  we  shall  make  use  of  the  following 
lenma,  which  we  do  ncrt  prove. 

Lama  1.  Any  n-polygon  partition  has  a  refinement 
whose  total  number  of  triangles  is  bounded  by  n  plus 
the  number  of  line  segments  bounding  ncn-triangles 
plus  a  constant  (a  line  segment  bounding  two  non¬ 
triangles  counts  twice  in  this  bound). 


Merge  pairs  of  adjacent  triangles  which  are  not 


0  to  form  a  polygon  partition  PA  .  pA 


in  C, 

^  2/5 

contains  at  most  0(n  '  )  line  segments,  since  each 

such  line  segment  must  be  a  bounding  segment  of  a 
triangle  in  T  .  Find  a  triangulation  T_  of  P- 
2/x  u  u 

with  0(n  '  )  triangles,  which  exists  by  Lemma  l. 
Using  the  given  algorithm,  determine  which  trlengle  or 
line  segment  of  T-  contains  v  . 


If  v  is  in  sane  triangle  of  CQ  ,  the  problem 

is  solved.  Otherwise,  v  is  known  to  be  in  some 
connected  set  of  triangles  in  T  minus  CQ  . 

Merge  pairs  of  adjacent  triangles  which  are  not  in 

to  form  a  polygon  partition  P.  .  Since  P.  contains 

2/5  1  1 

at  most  0(n  '  )  line  segments,  there  is  a  triangula¬ 
tion  Tj  of  P^  with  0(n?^ )  triangles.  Using  the 

given  algorithm,  determine  which  trlengle  or  line 
segment  of  T^  contains  v  .  This  solves  the  problem. 

The  sets  Cj  ,  polygon  partitions  ,  and 
triangulations  T,  are  all  precumputed.  Thus  the 
time  required  by  the  algorithm  is  0(log  n  '  )  to 


discover  which  triangle  of  TA  contains  v  ,  plus 

2/5  U 

O(log  n  '  )  to  discover  which  triangle  of 

contains  v  .  The  total  time  is  thus  O(log  n)  ,  The 
total  space  is  0(|T^|a)  <  0(n1+^£^)  .  □ 


Corollary  5.  For  any  c  >  0  there  is  an  0(log  n) 

-time,  Ofn1** )  -space  algorithm  Tor  the  triangle 
problem. 


Proof.  Immediate  from  Theorem  9,  using  the  0(log  n) 
-time,  0(n2)  -space  algorithm  of  [22]  as  a  starting 
point .  □ 

Theorem  10.  There  is  an  0(log  n)  -time,  0(n)  -space 
solution  to  the  triangle  problem. 


Proof.  Let  T  be  a  triangulation  and  v  a  vertex 
for  which  the  triangle  problem  is  to  be  solved,  if  T 
contains  no  more  than  n^  triangles,  where  nQ  is  a 

sufficiently  large  constant,  determine  which  triangle 
contains  v  by  testing  v  against  each  line  segment 
bounding  a  triangle  of  T  .  Otherwise,  let  C  be  a 

»et  of  0(r triangles  whose  removal  from  T 

L/5 

leaves  no  connected  set  of  more  than  0(n  '  ) 
triangles.  Group  the  connected  sets  of  triangles  in 
T  minus  CA  into  sets  C,  ,  each  containing  within  a 

u  U/5  1 

constant  factor  of  n  '  triangles. 


We  shall  build  up  a  sequence  of  more  and  more 
complicated  (but  more  and  more  efficient)  algorithms, 
the  last  of  which  is  the  desired  one. 

Theorem  9.  Given  an  0(log  n)  -time,  0(n^+e)  -space 
algorithm  for  the  triangle  problem  with  s  >  0  ,  one 

can  construct  an  0(log  n)  -time,  0(n^+2€^)  -apace 
algorithm. 

Proof.  Let  T  be  a  triangulation  and  v  be  a 
vertex  for  which  the  triangle  problem  is  to  be  solved. 

By  Theorem  2  there  is  a  set  of  0(n2^)  triangles  CQ 
whose  rancval  from  T  leaves  no  connected  set  of  more 
than  0(n2^5)  triangles. 


Merge  pairs  of  adjacent  triangles  which  are  not 
in  CA  to  form  a  polygon  partition  P»  .  Fn 
U  x/5  u  u 

contains  at  most  0(n^'  )  line  segments.  Find  a 

triangulation  TQ  of  PQ  with  0(n?^5l  triangles. 

Using  an  0(log  n)  -time,  O(n^)  -space  algorithm, 
determine  which  triangle  of  Tq  contains  v  . 

If  v  is  seme  triangle  of  CQ  ,  the  problem  is 
solved.  Otherwise  v  is  known  to  be  in  some  set  Cj  ■ 
Merge  pairs  of  adjacent  triangles  whlcb  are  not  m  Cj 
to  form  a  polygon  partition  Fj  .  Each  line  segment 
bounding  a  non -triangular  polygon  of  Fj  must  bound  a 
triangle  of  C0  .  Thus  there  is  a  trianguln* ion  Tj 
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Of  Fj  containing  |C±  |  +  0(r?^)  triangles.  Apply 

the  algorithm  recursively  to  discover  which  triangle 
of  Tj  contains  v  .  This  solves  the  problem. 

The  sets  Ci  ,  polygon  partitions  PA  ,  and 
triangulations  are  all  precomputed.  If  t(n) 

Is  the  worst-case  time  required  by  the  algorithm  on  an 
n-triangle  triangulatian,  then 

t(n)  «  0(1'  if  n  <  nQ  , 

t(r.)  *  t(0(n^'1 ) )  +  0(log  n)  otherwise. 

Ar.  Inductive  proof  shows  that  t(n)  is  0(log  n)  if 
r.  is  chosen  sufficiently  large. 

If  s(n)  is  the  worst-case  storage  space 
required  by  the  algorithm  on  an  n-triangle  triangula- 
tior.,  then 

s (t)  -  0(1)  if  n  <  nQ  , 

s(n)  <  0(n7/10)  +  max(£  s(ni  +  O(n^) )  |  E  lij  <  n 
and  Cjn1*/^  <  n,  <  Cgn1*^} 
for  sane  positive  constants  and  cg  . 

An  Inductive  proof  shows  that  s(n)  is  0(n)  .  □ 

The  preprocessing  time  required  by  the  algorithm 
of  Theorem  10  is  0(n  log  n)  .  See  [22].  We  do  not 
advocate  this  algorithm  as  a  practical  one,  but  its 
existence  suggests  that  there  may  be  a  practical 
algorithm  with  an  0(log  n)  time  bound  and  0(n) 
space  bound. 


5.  other  Applications. 

As  illustrated  in  this  paper,  Theorem  1  and  its 
corollaries  have  many  interesting  applications,  and 
the  paper  does  not  exhaust  them.  We  have  obtained  two 
additional  results  which  require  fuller  discussion 
than  is  possible  here.  One  is  the  application  of 
Theorem  1  to  Gaussian  elimination.  George  [8]  has 

;ro:-oed  an  0(n  log  n)  -space,  0(n^2)  -time  method 
of  carrying  out  Gaussian  elimination  on  a  system  of 
equations  whose  sparsity  structure  corresponds  to  a 
Vn  x  -Jn  square  grid.  We  can  generalize  his  method  so 
that  it  applies  to  any  system  of  equations  whose 
sparsity  structure  corresponds  to  a  planar  or  almost- 
planar  graph.  Such  systems  arise  in  the  solution  of 
two-dimensional  finite- element  problems  (15).  We 
shall  discuss  this  application  in  a  subsequent  paper; 
we  hope  that  it  will  prove  of  practical,  as  well  as 
theoretical,  value. 

Another  application  involves  the  power  of  ncn- 
lererrinism  in  one-tape  Turing  machines.  We  can  prove 
that  any  non-deterministic  t(n)  -time-bounded  one- 

•-a;*  Turing  machine  can  be  simulated  by  a  t(n)r 
alternating  one-tape  Turing  machine  with  a  constant 
■  md-er  of  alternations,  where  r  <  1  is  a  suitable 
-ar.i'ant  and  t(n)  satisfies  certain  reasonable 
r*,* ri -• s  ms.  Alternation  generalizes  the  concept  of 
'  r  leterminm  and  is  discussed  in  (},  12) ,  Our 
■•rer.gther.s  Paterson's  space-efficient 
».-•*.  r.  f  one-tape  Turing  machines  (17). 
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Figure  1.  Elimination  of  a  crossover  by  use  of  three  "exclusive  or"  gates. 

Reference  (9]  contains  a  crossover  circuit  which  uses  only  "and" 
and  "not". 
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